[ https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13287936#comment-13287936 ]
Michael McCandless commented on LUCENE-3892: -------------------------------------------- Awesome progress! Nice to have a dirt path online that we can then iterate from ... Hmm, I'm seeing some test failures when I run: {noformat} ant test -Dtests.postingsformat=PFor {noformat} Eg, TestNRTThreads, TestShardSearching, TestTimeLimitingCollector. Remember to add the standard copyright headers to each new source file... We don't have to do this now, but I wonder if we can share code w/ the packed ints impl we have, instead generating another one with the .py source. TestDemo makes a nice TestMin... I usually start with TestDemo when testing scary new code, and then it's a huge milestone once TestDemo passes :) We should definitely cutover to BlockTree terms dict (I would upgrade that TODO to a nocommit!). I suspect that wrapping the blocks byte[] as ByteBuffer and then IntBuffer is going to be too costly per decode so we should init them once and re-use (upgrade that TODO to a nocommit). > Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta, > Simple9/16/64, etc.) > ------------------------------------------------------------------------------------- > > Key: LUCENE-3892 > URL: https://issues.apache.org/jira/browse/LUCENE-3892 > Project: Lucene - Java > Issue Type: Improvement > Reporter: Michael McCandless > Labels: gsoc2012, lucene-gsoc-12 > Fix For: 4.1 > > Attachments: LUCENE-3892_pfor.patch, LUCENE-3892_settings.patch, > LUCENE-3892_settings.patch > > > On the flex branch we explored a number of possible intblock > encodings, but for whatever reason never brought them to completion. > There are still a number of issues opened with patches in different > states. > Initial results (based on prototype) were excellent (see > http://blog.mikemccandless.com/2010/08/lucene-performance-with-pfordelta-codec.html > ). > I think this would make a good GSoC project. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org