Hiroaki Kawai (JIRA) a écrit :
NGramTokenFilter optimization in query phase
--------------------------------------------

                 Key: LUCENE-1229
                 URL: https://issues.apache.org/jira/browse/LUCENE-1229
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/*
            Reporter: Hiroaki Kawai


I found that NGramTokenFilter-ed token stream could be optimized in query.

A standard 1,2 NGramTokenFilter will generate a token stream from "abcde" as 
follows:
a ab b bc c cd d de e

When we index "abcde", we'll use all of the tokens.

But when we query, we only need:
ab cd de
I don't understand why you index something that you will not query?
Why don'y you use a  bigram?

M.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to