RE: Relevance and ranking ...

2004-12-18 Thread Chuck Williams
The coord is the fraction of clauses matched in a BooleanQuery, so with your example of a 5-word BooleanQuery, the coord factors should be .4, .8, .8, 1.0 respectively for doc1, doc2, doc3 and doc4. One big issue you've got here is lengthNorm. Doc2 is 1/10 the size of doc4, so its lengthNorm is o

Re: Relevance and ranking ...

2004-12-18 Thread Gururaja H
Hi Erik, Created my own subclass of Similarity. When i printed the values for coord() factor i am getting the same for all the 4 documents. So the value is NOT getting boosted. Want to do this. as i want the document that has e.g., all three terms in a three word query over those that conta