I set this up a few years ago with something like the following: <fieldType name="autocomplete" class="solr.TextField"> <analyzer type="index"> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory" /> <filter class="solr.PatternReplaceFilterFactory" pattern="([^a-z0-9])" replacement="" replace="all" /> <filter class="solr.EdgeNGramFilterFactory" maxGramSize="20" minGramSize="1" /> </analyzer> <analyzer type="query"> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory" /> <filter class="solr.PatternReplaceFilterFactory" pattern="([^a-z0-9])" replacement="" replace="all" /> </analyzer> </fieldType>
<filter class="solr.PatternReplaceFilterFactory" pattern="([^a-z0-9])" replacement="" replace="all" /> is the bit missing i think here This way the search is agnostic to case and any non-alphanum chars, this was to facilitate a location autocomplete for searching So is was a basic search, returning the top N results along with additional info to show in the autocomplete to our mod_perl servers, Results were cached in the mod_perl servers. Regards, Dan On Thu, Sep 2, 2010 at 1:53 PM, Jason Rutherglen <jason.rutherg...@gmail.com > wrote: > I'm having a different issue with the EdgeNGram technique described > here: > http://www.lucidimagination.com/blog/2009/09/08/auto-suggest-from-popular-queries-using-edgengrams/ > > That is one word queries q=app on the query_text field, work fine > however "q=app mou" do not. Why would this be or is there a > configuration that could be missing? > > On Wed, Sep 1, 2010 at 3:53 PM, Eric Grobler <impalah...@googlemail.com> > wrote: > > Thanks for your feedback Robert, > > > > I will try that and see how Solr performs on my data - I think I will > create > > a field that contains only important key/product terms from the text. > > > > Regards > > Johan > > > > On Wed, Sep 1, 2010 at 9:12 PM, Robert Petersen <rober...@buy.com> > wrote: > > > >> We don't have that many, just a hundred thousand, and solr response > >> times (since the index's docs are small and not complex) are logged as > >> typically 1 ms if not 0 ms. It's funny but sometimes it is so fast no > >> milliseconds have elapsed. Incredible if you ask me... :) > >> > >> Once you get SOLR to consider the whole phrase as just one big term, the > >> wildcard is very fast. > >> > >> -----Original Message----- > >> From: Eric Grobler [mailto:impalah...@googlemail.com] > >> Sent: Wednesday, September 01, 2010 12:35 PM > >> To: solr-user@lucene.apache.org > >> Subject: Re: Auto Suggest > >> > >> Hi Robert, > >> > >> Interesting approach, how many documents do you have in Solr? > >> I have about 2 million and I just wonder if it might be a bit slow. > >> > >> Regards > >> Johan > >> > >> On Wed, Sep 1, 2010 at 7:38 PM, Robert Petersen <rober...@buy.com> > >> wrote: > >> > >> > I do this by replacing the spaces with a '%' in a separate search > >> field > >> > which is not parsed nor tokenized and then you can wildcard across the > >> > whole phrase like you want and the spaces don't mess you up. Just > >> store > >> > the original phrase with spaces in a separate field for returning to > >> the > >> > front end for display. > >> > > >> > -----Original Message----- > >> > From: Jazz Globe [mailto:jazzgl...@hotmail.com] > >> > Sent: Wednesday, September 01, 2010 7:33 AM > >> > To: solr-user@lucene.apache.org > >> > Subject: Auto Suggest > >> > > >> > > >> > Hallo > >> > > >> > How would one implement a multiple term auto-suggest feature in Solr > >> > that is filter sensitive? > >> > For example, a user enters : > >> > "mp3" > >> > and solr might suggest: > >> > -> "mp3 player" > >> > -> "mp3 nano" > >> > -> "mp3 sony" > >> > and then the user starts the second word : > >> > "mp3 n" > >> > and that narrows it down to: > >> > -> "mp3 nano" > >> > > >> > I had a quick look at the Terms Component. > >> > I suppose it just returns term totals for the entire index and cannot > >> be > >> > used with a filter or query? > >> > > >> > Thanks > >> > Johan > >> > > >> > > >> > > >> > > >