[ 
https://issues.apache.org/jira/browse/LUCENE-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

hao yan updated LUCENE-1410:
----------------------------

    Attachment: LUCENE-1410.patch

This patch is to add codec support for PForDelta compression algorithms.


Changes by Hao Yan (hyan2...@gmail.com)

In summary, I added five files to support and test the codec.

In Src,
1.      org.apache.lucene.index.codecs.pfordelta.PForDelta.java
2.      org.apache.lucene.index.codecs.pfordelta.Simple16.java
3.      org.apache.lucene.index.codecs.PForDeltaFixedBlockCodec.java
4.      
org.apache.lucene.index.codecs.intblock.FixedIntBlockIndexOutputWithGetElementNum.java

In Test,
5.      
org.apache.lucene.index.codecs.intblock.TestPForDeltaFixedIntBLockCodec.java

1)      In particular, the firs class PForDelta is the core implementation
of PForDelta algorithm, which compresses exceptions using Simple16
that is implemented in the second class Simple16.
2)      The third classs PForDeltaFixedBlockCodec is similar to
org.apache.lucene.index.codesc.ockintblock.MockFixedIntBlockCodec in
Test, except that it uses PForDelta to encode the data in the buffer.
3)      The fourth class is almost the same as
org.apache.lucene.index.codecs.intblock.FixedIntBlockINdexOuput,
except that it provides an additional public function to retrieve the
value of the upto field, which is private filed in
FixedIntBlockINdexOuput. The reason I added this public function is
that the number of elements in the block that have meaningful values is not 
always equal to the blockSize or the buffer
size since the last block/buffer of a stream of data usually only
contain less number of data. In the case, I will fill all elements after the 
meaningful elements with 0s. Thus, we alwasy compress one entire block.

4)      The last class is the unit test to test PForDeltaFixedIntBlockCodec
which is very similar to
org.apache.lucene.index.codecs.mintblock.TestIntBlockCodec.

I also changed the LuceneTestCase class to add the new
PForDeltaFixeIntBlockCOde.

The unit tests and all lucence tests have passed.


> PFOR implementation
> -------------------
>
>                 Key: LUCENE-1410
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1410
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Index
>            Reporter: Paul Elschot
>            Priority: Minor
>             Fix For: Bulk Postings branch
>
>         Attachments: autogen.tgz, for-summary.txt, 
> LUCENE-1410-codecs.tar.bz2, LUCENE-1410.patch, LUCENE-1410.patch, 
> LUCENE-1410.patch, LUCENE-1410b.patch, LUCENE-1410c.patch, 
> LUCENE-1410d.patch, LUCENE-1410e.patch, TermQueryTests.tgz, TestPFor2.java, 
> TestPFor2.java, TestPFor2.java
>
>   Original Estimate: 21840h
>  Remaining Estimate: 21840h
>
> Implementation of Patched Frame of Reference.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to