Hiroaki Kawai (JIRA) a écrit :
NGramTokenFilter optimization in query phase
--------------------------------------------
Key: LUCENE-1229
URL: https://issues.apache.org/jira/browse/LUCENE-1229
Project: Lucene - Java
Issue Type: Improvement
Components: contrib/*
Reporter: Hiroaki Kawai
I found that NGramTokenFilter-ed token stream could be optimized in query.
A standard 1,2 NGramTokenFilter will generate a token stream from "abcde" as
follows:
a ab b bc c cd d de e
When we index "abcde", we'll use all of the tokens.
But when we query, we only need:
ab cd de
I don't understand why you index something that you will not query?
Why don'y you use a bigram?
M.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]