Cache Used by IndexReader/IndexSearcher

2008-12-16 Thread Sangrish
Hi All, I have a 50 GB index of about 40 million documents. I need to query it around 40,000 times(different queries) one by one. I saw that the query times are negligible for the first, say 25,000 queries, but it degrades later on. For example, the time for 200 sequential queries chang

Re: Arbitrary String to String Similarity Score

2008-06-20 Thread Sangrish
ar/MoreLikeThis.java > > > I think that or something a lot like it is what you are after. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > - Original Message >> From: Sangrish <[EMAIL PROTECTED]> >> To: java

Re: Arbitrary String to String Similarity Score

2008-06-19 Thread Sangrish
you have str1 and str2, so why do you need to inject > an index into the middle of it? > > -Grant > > On Jun 19, 2008, at 8:33 PM, Sangrish wrote: > >> >> I have a use case for comparing two given strings (attached to a >> specific >> field) >&g

Arbitrary String to String Similarity Score

2008-06-19 Thread Sangrish
I have a use case for comparing two given strings (attached to a specific field) using Lucene and get the similarity scores. I tried but could not find any built-in way to do so. Hence assuming that Lucene only compares a Query against Indexed documents, I came up with the following approach: (