Re: Sorting posting lists before intersection

Renaud Delbru Mon, 13 Oct 2008 08:52:55 -0700

Hi,

Paul Elschot wrote:

This could be done, but since not all scorers will be TermScorers it
will be necessary to add a method to Scorer (or perhaps even to its
DocIdSetIterator superclass):


   public abstract int estimatedDocFreq();

and implement this for all existing instances. TermScorer could
implement it without estimating.
For AND/OR/NOT such an estimation is straightforward but for
proximity queries it would be more of a guess.

I agree. Indeed, for proximity queries, it is more tricky. Maybe takingthe frequency of the rarest term in a PhraseQuery / SpanQuery could be anot so bad predictor in general.


Regards.
--
Renaud Delbru

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Sorting posting lists before intersection

Reply via email to