Levenstein Distance

2012-06-06 Thread Gau
I have a list of synoynms which is being expanded at query time. This yields a lot of results (in millions). My use-case is name search. I want to sort the results by Levenstein Distance. I know this can be done with strdist function. But sorting being inefficient and Solr function adding to its

Sorting performance

2012-06-04 Thread Gau
Here is the usecase: I am using synonym expansion at query time to get results. this is essentially a name search, so a search for Jim may be expanded at query time for James, Jung, Jimmy, etc. So ranking fields like TF, IDF, Norms do not mean anything to me. I just reset them to zero. so all the

Sorting performance

2012-06-04 Thread Gau
Here is the usecase: I am using synonym expansion at query time to get results. this is essentially a name search, so a search for Jim may be expanded at query time for James, Jung, Jimmy, etc. So ranking fields like TF, IDF, Norms do not mean anything to me. I just reset them to zero. so all the

Re: Difference between textfield and strfield

2012-06-01 Thread Gau
is there any other option to sorting. I mean, sorting can affect query performance. Is there a way to embed this into Solr and not have a toll on the system, I tried boosting the scores based on strdist, but that seems to bring in more results than expected. -- View this message in context:

Difference between textfield and strfield

2012-05-30 Thread Gau
Hi, Can anyone explain me the basic pros and cons between textfield and strfield. I am trying to use Levenstein distance on textfield, but it seems that it can only be applied on the strfield. So my question is whats the difference between the 2 and what are the radical advantages of one over

Re: Difference between textfield and strfield

2012-05-30 Thread Gau
Well the I do not have phrases for synonym expansion. So it does work well. The synonym expansion is done at query time. And since i am just searching against the first name field, tf, idf and other ranking parameters do not make sense, hence their weight has been initialized to 1. So after

Re: Difference between textfield and strfield

2012-05-30 Thread Gau
I cannot move from textfield to strfield, since I am using synonym expansion. Is there anything we can do on textfield itself -- View this message in context: http://lucene.472066.n3.nabble.com/Difference-between-textfield-and-strfield-tp3986916p3986938.html Sent from the Solr - User mailing

Relevancy ranking for synonym matches

2012-05-29 Thread Gau
I was wondering if there is any solution for this. Currently I expand my results to match the synonyms at query time. So if I entered James, I would get results for Jim, Gomes, Game etc as they would be expanded by matching the synonyms for James. But then since this is just a one word match, tf,

Re: Solr boost relevancy

2012-05-27 Thread Gau
Hi Lori, Yeah. I thought exactly of the same solution. Use a copy field and boost the relevancy of the the exact match. But my question is more broad here. For eg, if i have a synonym for James as Jim, Games, Jimmy, Jameson And if I normalize the tf, norm, etc factors to 1, on searching for

Re: Solr boost relevancy

2012-05-27 Thread Gau
Wait, I thought the fuzzy match is invoked with a ~. I am not invoking any ~ but expanding my query terms with the synonyms at query time. So from what I understarnd, when I query for James, internally, Solr would expand using synonym search to James, Jim, Games, Jameson. So I guess, the original

Solr boost relevancy

2012-05-25 Thread Gau
Consider a db of just names. Now if I use synonym expansion at query time, I get a set of results. (Background: I created a class, which resets idf, tf, .. .all to 1) since they dont matter to me anymore. What really matters is, how closely does the query match to the given name. Currently I am