Ah Thanks for clearing that up. Does anyone know how to deal with these 2 issues when using NGramFilterFactory for autocomplete?
1) hyphens - if user types "ema" or "e-ma" I want to suggest "email" 2) accents - if user types "herme" want to suggest "Hermès" Thanks. --- On Sun, 10/3/10, Ahmet Arslan <iori...@yahoo.com> wrote: > From: Ahmet Arslan <iori...@yahoo.com> > Subject: Re: NGramFilterFactory for auto-complete that matches the middle of > multi-lingual tags? > To: solr-user@lucene.apache.org > Date: Sunday, October 3, 2010, 6:26 AM > > But I thought NGramFilterFactory > would generate substrings > > that start in the "middle", hence ensuring > autocomplete > > matching in the middle. > > > > So in the case of "electric guitar", keywordtokenizer > would > > create one token - "electric guitar" > > > > NGramFilterFactory would then take that one toke > ("electric > > guitar") and generate N-grams out of it. One of the > ngrams > > would be "guit" because "guit" is a substring of > "electric > > guitar". > > > > Ups. You are correct, I am sorry. I mixed it with > *Edge*NGramFilterFActory. > > > >