I'm well aware that some queries will return no results due to my filtering by 0.3. That's the point. I expect that some of my input queries will not be a good match to *any* of the documents in my second index.
I'm really doing something much like the "Books Like This" example in Chapter 5 of Lucene in Action (which I saw after I wrote this). It is unfortunate that some scores are being normalized and some may not be. Is there a way to obtain the unnormalized score? Donna Gresh Chris Hostetter <[EMAIL PROTECTED]> 03/29/2007 06:26 PM Please respond to java-user@lucene.apache.org To java-user@lucene.apache.org cc Subject Re: normalized scores : For a given query (for a single input document), the highest score is : *not* always 1 (which is just how : I want it). Is this because I am using a Boolean query? Here is my code : snippet. the Hits class only normalizes scores if the highest score is greater then one, if it's less then 1 no normalization happens. as to your more general question... : Recent questions about whether/how scores are normalized got me wondering : how my application (happily) seems to be doing what I want. I have two it's all a question of what you want ... what you've got is throwing things out with a score less then 0.3 ... but that's an arbitrary decision -- there is no mathematical basis for assuming a documentwhich scores "0.31" agaisnt query A is better match on A then a doc which scores 0.29 against query B is for B ... they are apples and oranges. you can be as arbitrary as you want ... you could decide to ignore every even numbered hit if you want -- it's entirely your choice, but it's not a ratinal choice. BTW: i hope you realize based on your comment about not all Hits having a max score of 1, for some queries, the highest scoring doc might not even have a score above 0.3, in which case you would be ignoring all matches. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]