[
https://issues.apache.org/jira/browse/LUCENE-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13907044#comment-13907044
]
Robert Muir commented on LUCENE-5460:
-------------------------------------
Not that TODO: and I would remove that TODO myself.
That TODO is about whether to pass the filter down as Bits, and its based on
sparsity.
The problem here is the two strategies (LEAP_FROG_FILTER_FIRST_STRATEGY,
LEAP_FROG_QUERY_FIRST_STRATEGY). We should provide a way that determines this
based on cost(). Perhaps it could simply be AUTO_STRATEGY or something like
that (similar to multitermquery), still giving explicit control if users want
to bypass the heuristics.
> Allow driving a query by sparse filters
> ---------------------------------------
>
> Key: LUCENE-5460
> URL: https://issues.apache.org/jira/browse/LUCENE-5460
> Project: Lucene - Core
> Issue Type: Improvement
> Components: core/search
> Reporter: Shai Erera
>
> Today if a filter is very sparse we execute the query in sort of a leap-frog
> manner between the query and filter. If the query is very expensive to
> compute, and/or matching few docs only too, calling scorer.advance(doc) just
> to discover the doc it landed on isn't accepted by the filter, is a waste of
> time. Since Filter is always the "final ruler", I wonder if we had something
> like {{boolean DISI.advanceExact(doc)}} we could use it instead, in some
> cases.
> There are many combinations in which I think we'd want to use/not-use this
> API, and they depend on: Filter's complexity, Filter.cost(), Scorer.cost(),
> query complexity (span-near, many clauses) etc.
> I open an issue so we can discuss. DISI.advanceExact(doc) is just a
> preliminary proposal, to get an API we could experiment with. The default
> implementation should be fairly easy and straightforward, and we could
> override where we can offer a more optimized imp.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]