[ https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13431983#comment-13431983 ]
Michael McCandless commented on LUCENE-3892: -------------------------------------------- OK indeed PFOR is slower for me too: {noformat} Task QPS base StdDev base QPS pfor StdDev pfor Pct diff HighPhrase 1.56 0.03 1.25 0.12 -28% - -10% MedPhrase 13.05 0.10 10.50 0.58 -24% - -14% LowPhrase 21.08 0.08 17.35 0.85 -22% - -13% AndHighMed 73.78 0.66 62.50 1.68 -18% - -12% AndHighLow 674.60 2.54 573.00 12.06 -17% - -12% LowSpanNear 8.04 0.17 6.97 0.23 -17% - -8% MedSpanNear 3.97 0.10 3.58 0.15 -15% - -3% MedSloppyPhrase 7.58 0.11 6.93 0.14 -11% - -5% AndHighHigh 25.71 0.47 23.58 0.61 -12% - -4% HighSpanNear 1.42 0.04 1.31 0.05 -12% - -1% MedTerm 155.44 18.75 144.46 12.33 -24% - 14% HighTerm 30.27 4.31 28.25 2.88 -26% - 19% LowSloppyPhrase 6.73 0.13 6.28 0.12 -10% - -3% OrHighHigh 9.06 0.24 8.53 0.33 -11% - 0% OrHighLow 23.09 0.67 21.88 0.91 -11% - 1% OrHighMed 17.71 0.51 16.79 0.67 -11% - 1% HighSloppyPhrase 1.88 0.05 1.80 0.04 -9% - 0% IntNRQ 9.42 0.50 9.05 0.89 -17% - 11% Prefix3 72.67 2.42 70.42 3.61 -11% - 5% Fuzzy1 63.71 1.07 62.34 1.55 -6% - 1% Wildcard 45.25 0.99 44.28 1.55 -7% - 3% PKLookup 159.04 2.13 157.17 1.90 -3% - 1% Fuzzy2 62.51 2.28 63.40 1.65 -4% - 8% LowTerm 400.06 57.60 407.73 52.40 -22% - 34% Respell 56.72 3.19 59.83 2.10 -3% - 15% {noformat} I think we should replace Block with BlockPacked now? > Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta, > Simple9/16/64, etc.) > ------------------------------------------------------------------------------------- > > Key: LUCENE-3892 > URL: https://issues.apache.org/jira/browse/LUCENE-3892 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Michael McCandless > Labels: gsoc2012, lucene-gsoc-12 > Fix For: 4.1 > > Attachments: LUCENE-3892-BlockTermScorer.patch, > LUCENE-3892-blockFor&hardcode(base).patch, > LUCENE-3892-blockFor&packedecoder(comp).patch, > LUCENE-3892-blockFor-with-packedints-decoder.patch, > LUCENE-3892-blockFor-with-packedints-decoder.patch, > LUCENE-3892-blockFor-with-packedints.patch, LUCENE-3892-blockpfor.patch, > LUCENE-3892-bulkVInt.patch, LUCENE-3892-direct-IntBuffer.patch, > LUCENE-3892-for&pfor-with-javadoc.patch, LUCENE-3892-handle_open_files.patch, > LUCENE-3892-non-specialized.patch, > LUCENE-3892-pfor-compress-iterate-numbits.patch, > LUCENE-3892-pfor-compress-slow-estimate.patch, LUCENE-3892_for_byte[].patch, > LUCENE-3892_for_int[].patch, LUCENE-3892_for_unfold_method.patch, > LUCENE-3892_pfor_unfold_method.patch, LUCENE-3892_pulsing_support.patch, > LUCENE-3892_settings.patch, LUCENE-3892_settings.patch > > > On the flex branch we explored a number of possible intblock > encodings, but for whatever reason never brought them to completion. > There are still a number of issues opened with patches in different > states. > Initial results (based on prototype) were excellent (see > http://blog.mikemccandless.com/2010/08/lucene-performance-with-pfordelta-codec.html > ). > I think this would make a good GSoC project. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org