Re: [jira] Created: (LUCENE-1229) NGramTokenFilter optimization in query phase

Mathieu Lecarme Fri, 14 Mar 2008 02:08:49 -0700

Hiroaki Kawai (JIRA) a écrit :

NGramTokenFilter optimization in query phase
--------------------------------------------


                 Key: LUCENE-1229
                 URL: https://issues.apache.org/jira/browse/LUCENE-1229
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/*
            Reporter: Hiroaki Kawai


I found that NGramTokenFilter-ed token stream could be optimized in query.

A standard 1,2 NGramTokenFilter will generate a token stream from "abcde" as 
follows:
a ab b bc c cd d de e

When we index "abcde", we'll use all of the tokens.

But when we query, we only need:
ab cd de

I don't understand why you index something that you will not query?
Why don'y you use a  bigram?

M.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Created: (LUCENE-1229) NGramTokenFilter optimization in query phase

Reply via email to