Hi,

I want to do a fuzzy search that compare a phrase to a field in solr. For
example:

"abc company ltda" will be compared to "abc comp", "abc corporation", "def
company ltda", "nothing to match here".

The thing is the it has to always returns documents sorted by its score.

I've found some good algorithms to do that, like StrikeAMatch[1] and
JaroWinkler.

Using the JaroWinkler with strdist() I can do exactly that. But, I rather
prefer to use the StrikeAMatch that had a patch in the lucene jira that was
never commited.

So, I contacted the author of that patch and he told me that I should use
the solr 4.0 that it has now some pretty good new fuzzy search enhancements
that made StrikeAMatch seems toys for kids.

Anyone know how can I achieve that using solr 4.0?

[1] http://www.catalysoft.com/articles/StrikeAMatch.html

Reply via email to