[ 
https://issues.apache.org/jira/browse/LUCENE-5084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13697584#comment-13697584
 ] 

Paul Elschot edited comment on LUCENE-5084 at 7/2/13 7:51 AM:
--------------------------------------------------------------

bq. Have you considered creating a PostingFormat with this? I was thinking in 
something like DirectPostingsFormat but instead of using an array of ints for 
storing the docIds using an Elias-Fano compressed bit stream.

The Vigna paper is all about posting formats.
Because of this I first implemented an encoder and a decoder in a long format, 
and then used these here for a DocIdSet that works on int.

For a postings format, the encoder would need an additional constructor from 
index data. That might involve merging the currently separate long arrays for 
high bits and low bits into a single array.
                
      was (Author: [email protected]):
    bq. Have you considered creating a PostingFormat with this? I was thinking 
in something like DirectPostingsFormat but instead of using an array of ints 
for storing the docIds using an Elias-Fano compressed bit stream.

                  
> EliasFanoDocIdSet
> -----------------
>
>                 Key: LUCENE-5084
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5084
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Paul Elschot
>            Assignee: Adrien Grand
>            Priority: Minor
>         Attachments: LUCENE-5084.patch
>
>
> DocIdSet in Elias-Fano encoding

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to