document retrieval 100 times slower after finishing some heavy disk operation

2008-06-28 Thread qaz zaq
Hi,   I ran into a very strange situation regarding document retrieval slowness and want to get some advice urgently.   I have 2 FSDirectory indexes each with size about 500M. I have 2 parallel search threads fetching 200 documents from these 2 indexes which usually take less then 16ms.

Duplicates removal in search results

2006-12-14 Thread qaz zaq
How can i remove the duplicates records in the search results. i.e., I have multiple results with the same title in 'title' field, and I want to only 1 record per title, how can I achieve that? thanks!! - Everyone is raving about the all-new Yahoo! Mail beta.

Duplicates removal in search results

2006-12-14 Thread qaz zaq
How can i remove the duplicates records in the search results. i.e., I have multiple results with the same title in 'title' field, and I want to only 1 record per title, how can I achieve that? thanks!! Need

Re: Analyzer.getPositionIncrementGap question

2006-10-26 Thread qaz zaq
to the SynonymFilter strategy. Hope this helps Erick On 10/25/06, qaz zaq wrote: I have multiple values want to add to the same FIELD, and I also want to add non-zero but NON CONSTANT position increment gap among those values. e.g., gap between value1 and value2 is 10, but gap between value2

Analyzer.getPositionIncrementGap question

2006-10-25 Thread qaz zaq
I have multiple values want to add to the same FIELD, and I also want to add non-zero but NON CONSTANT position increment gap among those values. e.g., gap between value1 and value2 is 10, but gap between value2 and value3 is 100. is there any how can I achieve that? d.add(new Field

Lucene scoring question (how to boost leading terms match)

2006-10-03 Thread qaz zaq
Hi, I have a question about the lucene scoring. In my following example, how can I ensure the doc1 has the higher score than doc2, if I search for A*. In another words, I want to boost the docs which match their leading terms. doc1: Aterm Bterm Cterm doc2: Bterm Aterm Cterm