Re: Is there Downside to a huge synonyms file?

2009-06-04 Thread Yonik Seeley
On Tue, Jun 2, 2009 at 11:28 PM, anuvenk anuvenkat...@hotmail.com wrote: I'm using query time synonyms. These don't currently work if the synonyms expand to more than one option, and those options have a different number of words. -Yonik http://www.lucidimagination.com

Re: Is there Downside to a huge synonyms file?

2009-06-03 Thread anuvenk
I tried adding some city to state mappings in the synonyms file. I'm using the dismax handler for phrase matching. So as when i add more more city to state mappings, I end up with zero results for state based searches. Eg: ca,california,los angeles ca,california,san diego

Re: Is there Downside to a huge synonyms file?

2009-06-03 Thread anuvenk
A small addition to my earlier post. I wonder if its because of the 'mm' param, which requires that until 3 words in search phrase, all the words should be matched. If i alter this now, i'd get ir-relevant results for a lot of popular 1, 2, 3 word search terms. How to solve for this? anuvenk

Re: Is there Downside to a huge synonyms file?

2009-06-02 Thread Otis Gospodnetic
Hi, If index-time synonym expansion/indexing is used, then a large synonym file means your index is going to be bigger. If query-time synonym expansion is used, then your queries are going to be larger (i.e. more ORs, thus a bit slower). How much, it really depends on your specific synonyms,

Re: Is there Downside to a huge synonyms file?

2009-06-02 Thread anuvenk
I'm using query time synonyms. I have more fields in my index though. This is just an example or sample of data from my index. Yes, we don't have millions of documents. Could be around 300,000 and might increase in future. The reason i'm using query time synonyms is because of the nature of my

Re: Is there Downside to a huge synonyms file?

2009-06-02 Thread Otis Gospodnetic
Hello, 300K is a pretty small index. I wouldn't worry about the number of synonyms unless you are turning a single term into dozens of ORed terms. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: anuvenk anuvenkat...@hotmail.com To: