Thanks Mike. I'm not sure this _should_ be fixed mind you, but thought I'd ask.
On Thu, Sep 22, 2016 at 10:16 AM, Michael McCandless
wrote:
> You could index the prefix terms (edge ngrams), assuming your queries
> are prefix queries; this way there would typically be
You could index the prefix terms (edge ngrams), assuming your queries
are prefix queries; this way there would typically be far fewer terms
to visit than all 200 M terms.
Auto-prefix terms also tried to solves this more "automatically", so
you don't have to mess with edge ngrams, but we reverted
In MultiTermConstantScoreWrapper there's this block around line 174 in 6x:
do {
docs = termsEnum.postings(docs, PostingsEnum.NONE);
builder.add(docs);
} while (termsEnum.next() != null);
In the case of lots and lots of terms in a multiValued field this can
take quite a bit of time. In my