I've been thinking about the CommonGramsFilter for a while, and am confused about how it works. Can anyone provide examples? Are you meant to include the analyzer at both index and query time? The description on the wiki says among other things: "The CommonGramsQueryFilter converts the phrase query "the cat" into the single term query the_cat." -- does that mean it _only_ works on phrase queries? If you've indexed with commongrams, what will happen at query time to a non-phrase query <<the cat>> ? Very confused. ________________________________________ From: Steven A Rowe [sar...@syr.edu] Sent: Thursday, September 23, 2010 8:21 AM To: solr-user@lucene.apache.org Subject: RE: bi-grams for common terms - any analyzers do that?
<http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.CommonGramsFilterFactory> > -----Original Message----- > From: Andy [mailto:angelf...@yahoo.com] > Sent: Thursday, September 23, 2010 6:05 AM > To: solr-user@lucene.apache.org > Subject: bi-grams for common terms - any analyzers do that? > > Hi, > > I was going thru this LucidImagnaton presentation on analysis: > > http://www.slideshare.net/LucidImagination/analyze-this-tips-and-tricks- > on-getting-the-lucene-solr-analyzer-to-index-and-search-your-content-right > > 1) on p.31-33, it talks about forming bi-grams for the 32 most common > terms during indexing. Is there an analyzer that does that? > > 2) on p. 34, it mentions that the default Solr configuraton would turn > "L'art" into the phrase query "L art" but it is much more efficient to > turn it into a single token 'L art'. Which analyzer would do that? > > Thanks. > Andy > > >