[
https://issues.apache.org/jira/browse/LUCENE-5293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13798499#comment-13798499
]
Paul Elschot commented on LUCENE-5293:
--------------------------------------
bq. relax the Elias-Fano doc id set building process so that it could be
built by providing an approximation of the number of docs in the set (at the
cost of some compression loss)
The approximation would have to be a lower bound, i.e. it might be higher than
the actual number of documents.
The EliasFanoEncoder reserves all the memory it needs at construction time, so
the loss in compression would be roughly as noticable as the accuracy of the
bound.
DocIdSetIterator.cost has another purpose, so it's not worthwhile to use it
here I think.
Does the faster build time of the EF DocIdSet (compared to WAH8 and FBS) allow
for an extra FBS to be built?
That is not immediately clear from the benchmark results, but it could be so.
> Also use EliasFanoDocIdSet in CachingWrapperFilter
> --------------------------------------------------
>
> Key: LUCENE-5293
> URL: https://issues.apache.org/jira/browse/LUCENE-5293
> Project: Lucene - Core
> Issue Type: Improvement
> Components: core/search
> Reporter: Paul Elschot
> Priority: Minor
> Attachments: LUCENE-5293.patch
>
>
--
This message was sent by Atlassian JIRA
(v6.1#6144)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]