I've been thinking about the CommonGramsFilter for a while, and am confused 
about how it works. Can anyone provide examples?  Are you meant to include the 
analyzer at both index and query time?  The description on the wiki says among 
other things: "The CommonGramsQueryFilter converts the phrase query "the cat" 
into the single term query the_cat." -- does that mean it _only_ works on 
phrase queries?    If you've indexed with commongrams, what will happen at 
query time to a non-phrase query <<the cat>> ?   Very confused. 
________________________________________
From: Steven A Rowe [sar...@syr.edu]
Sent: Thursday, September 23, 2010 8:21 AM
To: solr-user@lucene.apache.org
Subject: RE: bi-grams for common terms - any analyzers do that?

<http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.CommonGramsFilterFactory>


> -----Original Message-----
> From: Andy [mailto:angelf...@yahoo.com]
> Sent: Thursday, September 23, 2010 6:05 AM
> To: solr-user@lucene.apache.org
> Subject: bi-grams for common terms - any analyzers do that?
>
> Hi,
>
> I was going thru this LucidImagnaton presentation on analysis:
>
> http://www.slideshare.net/LucidImagination/analyze-this-tips-and-tricks-
> on-getting-the-lucene-solr-analyzer-to-index-and-search-your-content-right
>
> 1) on p.31-33, it talks about forming bi-grams for the 32 most common
> terms during indexing. Is there an analyzer that does that?
>
> 2) on p. 34, it mentions that the default Solr configuraton would turn
> "L'art" into the phrase query "L art" but it is much more efficient to
> turn it into a single token 'L art'. Which analyzer would do that?
>
> Thanks.
> Andy
>
>
>

Reply via email to