Re: Solr 4.0 -> Fuzzy query and Proximity query

2013-08-28 Thread Walter Underwood
Mixing fuzzy with phonetic can give bizarre matches. I worked on a search engine that did that. You really don't want to mix stemming, phonetic, and fuzzy. They are distinct transformations of the surface word that do different things. Stemming: conflate different inflections of the same word,

Re: Solr 4.0 -> Fuzzy query and Proximity query

2013-08-28 Thread Erick Erickson
No, ComplexPhraseQuery has been around for quite a while but never incorporated into the code base, it's pretty much what you need to do both fuzzy and phrase at once. But, doesn't phonetic really incorporate at least a flavor of fuzzy? Is it close enough for your needs to just do phonetic matches

Re: Solr 4.0 -> Fuzzy query and Proximity query

2013-08-28 Thread Prasi S
sry , i copied it wrong. Below is the correct analysis. Index time ST trinity services SF trinity services LCF trinity services SF trinity services SF trinity services WDF trinity services SF triniti servic PF TRNTtriniti SRFKservic HWF TRNTtriniti SRFKservic PSF TRNTtriniti SRFKservic *Query

Re: Solr 4.0 -> Fuzzy query and Proximity query

2013-08-28 Thread Prasi S
hi Erick, Yes it is correct. These results are because of stemming + phonetic matching. Below is the Index time ST trinity services SF trinity services LCF trinity services SF trinity services SF trinity services WDF trinity services Query time SF triniti servic PF TRNTtriniti SRFKservic HWF TRN

Re: Solr 4.0 -> Fuzzy query and Proximity query

2013-08-28 Thread Erick Erickson
The first thing I'd recommend is to look at the admin/analysis page. I suspect you aren't seeing fuzzy query results at all, what you're seeing is the result of stemming. Stemming is algorithmic, so sometimes produces very surprising results, i.e. Trinidad and Trinigee may stem to something like t

Solr 4.0 -> Fuzzy query and Proximity query

2013-08-28 Thread Prasi S
Hi, with solr 4.0 the fuzzy query syntax is like ~1 (or 2) Proximity search is like "value"~20. How does this differentiate between the two searches. My thought was promiximity would be on phrases and fuzzy on individual words. Is that correct? I wasnted to do a promiximity search for text field