[ 
https://issues.apache.org/jira/browse/LUCENE-5293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13798499#comment-13798499
 ] 

Paul Elschot commented on LUCENE-5293:
--------------------------------------

bq.   relax the Elias-Fano doc id set building process so that it could be 
built by providing an approximation of the number of docs in the set (at the 
cost of some compression loss)

The approximation would have to be a lower bound, i.e. it might be higher than 
the actual number of documents.
The EliasFanoEncoder reserves all the memory it needs at construction time, so 
the loss in compression would be roughly as noticable as the accuracy of the 
bound.
DocIdSetIterator.cost has another purpose, so it's not worthwhile to use it 
here I think.

Does the faster build time of the EF DocIdSet (compared to WAH8 and FBS) allow 
for an extra FBS to be built?
That is not immediately clear from the benchmark results, but it could be so.


> Also use EliasFanoDocIdSet in CachingWrapperFilter
> --------------------------------------------------
>
>                 Key: LUCENE-5293
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5293
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/search
>            Reporter: Paul Elschot
>            Priority: Minor
>         Attachments: LUCENE-5293.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to