[ https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
hao yan updated LUCENE-1410: ---------------------------- Attachment: LUCENE-1410.patch This patch is to add codec support for PForDelta compression algorithms. Changes by Hao Yan (hyan2...@gmail.com) In summary, I added five files to support and test the codec. In Src, 1. org.apache.lucene.index.codecs.pfordelta.PForDelta.java 2. org.apache.lucene.index.codecs.pfordelta.Simple16.java 3. org.apache.lucene.index.codecs.PForDeltaFixedBlockCodec.java 4. org.apache.lucene.index.codecs.intblock.FixedIntBlockIndexOutputWithGetElementNum.java In Test, 5. org.apache.lucene.index.codecs.intblock.TestPForDeltaFixedIntBLockCodec.java 1) In particular, the firs class PForDelta is the core implementation of PForDelta algorithm, which compresses exceptions using Simple16 that is implemented in the second class Simple16. 2) The third classs PForDeltaFixedBlockCodec is similar to org.apache.lucene.index.codesc.ockintblock.MockFixedIntBlockCodec in Test, except that it uses PForDelta to encode the data in the buffer. 3) The fourth class is almost the same as org.apache.lucene.index.codecs.intblock.FixedIntBlockINdexOuput, except that it provides an additional public function to retrieve the value of the upto field, which is private filed in FixedIntBlockINdexOuput. The reason I added this public function is that the number of elements in the block that have meaningful values is not always equal to the blockSize or the buffer size since the last block/buffer of a stream of data usually only contain less number of data. In the case, I will fill all elements after the meaningful elements with 0s. Thus, we alwasy compress one entire block. 4) The last class is the unit test to test PForDeltaFixedIntBlockCodec which is very similar to org.apache.lucene.index.codecs.mintblock.TestIntBlockCodec. I also changed the LuceneTestCase class to add the new PForDeltaFixeIntBlockCOde. The unit tests and all lucence tests have passed. > PFOR implementation > ------------------- > > Key: LUCENE-1410 > URL: https://issues.apache.org/jira/browse/LUCENE-1410 > Project: Lucene - Java > Issue Type: New Feature > Components: Index > Reporter: Paul Elschot > Priority: Minor > Fix For: Bulk Postings branch > > Attachments: autogen.tgz, for-summary.txt, > LUCENE-1410-codecs.tar.bz2, LUCENE-1410.patch, LUCENE-1410.patch, > LUCENE-1410.patch, LUCENE-1410b.patch, LUCENE-1410c.patch, > LUCENE-1410d.patch, LUCENE-1410e.patch, TermQueryTests.tgz, TestPFor2.java, > TestPFor2.java, TestPFor2.java > > Original Estimate: 21840h > Remaining Estimate: 21840h > > Implementation of Patched Frame of Reference. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org