Re: Fuzzy searching, tildes and solr

2007-01-26 Thread Yonik Seeley
On 1/26/07, Walter Lewis <[EMAIL PROTECTED]> wrote: Yonik Seeley wrote: > +(+text:jame +text:sutherland) +searchSet:testSet >> +(+text:james~0.75 +text:sutherland~0.75) +searchSet:testSet > > I can tell from the first that this is a stemmed field... "james" is > transformed to "jame" "James" bein

Re: Fuzzy searching, tildes and solr

2007-01-26 Thread Walter Lewis
Yonik Seeley wrote: +(+text:jame +text:sutherland) +searchSet:testSet +(+text:james~0.75 +text:sutherland~0.75) +searchSet:testSet I can tell from the first that this is a stemmed field... "james" is transformed to "jame" "James" being the plural of "Jame" according to the stemmer. I guess my

Re: Fuzzy searching, tildes and solr

2007-01-25 Thread Yonik Seeley
On 1/25/07, Walter Lewis <[EMAIL PROTECTED]> wrote: I ran the debug against the two following queries: q=(James Sutherland) returns 13 q=(James~0.75 Sutherland~0.75) returns 1 OK, I have an idea of what's going on... here are your two parsed queries side by side: +(+text:jame +text:sut

Re: Fuzzy searching, tildes and solr

2007-01-25 Thread Walter Lewis
Yonik Seeley wrote: On 1/23/07, Walter Lewis <[EMAIL PROTECTED]> wrote: This is quite possibly a Lucene question rather than a solr one, so my apologies if you think its out of scope. Underlying the solr search, are some very useful Lucene constructs. One of the most powerful, imho, is the til

Re: Fuzzy searching, tildes and solr

2007-01-23 Thread Yonik Seeley
On 1/23/07, Walter Lewis <[EMAIL PROTECTED]> wrote: This is quite possibly a Lucene question rather than a solr one, so my apologies if you think its out of scope. Underlying the solr search, are some very useful Lucene constructs. One of the most powerful, imho, is the tilde number combination

Fuzzy searching, tildes and solr

2007-01-23 Thread Walter Lewis
This is quite possibly a Lucene question rather than a solr one, so my apologies if you think its out of scope. Underlying the solr search, are some very useful Lucene constructs. One of the most powerful, imho, is the tilde number combination for a "fuzzy" search. In one of my data sets