Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

Andy Sun, 03 Oct 2010 14:48:59 -0700

Ah Thanks for clearing that up.

Does anyone know how to deal with these 2 issues when using NGramFilterFactory 
for autocomplete?


1) hyphens - if user types "ema" or "e-ma" I want to suggest "email"

2) accents - if user types "herme"  want to suggest "Hermès"

Thanks.

--- On Sun, 10/3/10, Ahmet Arslan <iori...@yahoo.com> wrote:

> From: Ahmet Arslan <iori...@yahoo.com>
> Subject: Re: NGramFilterFactory for auto-complete that matches the middle of 
> multi-lingual tags?
> To: solr-user@lucene.apache.org
> Date: Sunday, October 3, 2010, 6:26 AM
> > But I thought NGramFilterFactory
> would generate substrings
> > that start in the "middle", hence ensuring
> autocomplete
> > matching in the middle.
> > 
> > So in the case of "electric guitar", keywordtokenizer
> would
> > create one token - "electric guitar"
> > 
> > NGramFilterFactory would then take that one toke
> ("electric
> > guitar") and generate N-grams out of it. One of the
> ngrams
> > would be "guit" because "guit" is a substring of
> "electric
> > guitar".
> > 
> 
> Ups. You are correct, I am sorry. I mixed it with
> *Edge*NGramFilterFActory.
> 
> 
>       
>

Re: NGramFilterFactory for auto-complete that matches the middle of multi-lingual tags?

Reply via email to