Re: normalized scores

Donna L Gresh Fri, 30 Mar 2007 04:48:55 -0800

I'm well aware that some queries will return no results due to my 
filtering by 0.3. 
That's the point. I expect that some of my input queries will not be a 
good match
to *any* of the documents in my second index.


I'm really doing something much like
the "Books Like This" example in Chapter 5 of Lucene in Action (which I 
saw after I wrote this). 
It is unfortunate that some scores are being normalized and some may not 
be. Is there a
way to obtain the unnormalized score?


Donna Gresh





Chris Hostetter <[EMAIL PROTECTED]> 
03/29/2007 06:26 PM
Please respond to
[email protected]


To
[email protected]
cc

Subject
Re: normalized scores






: For a given query (for a single input document), the highest score is
: *not* always 1 (which is just how
: I want it). Is this because I am using a Boolean query? Here is my code
: snippet.

the Hits class only normalizes scores if the highest score is greater then
one, if it's less then 1 no normalization happens.

as to your more general question...

: Recent questions about whether/how scores are normalized got me 
wondering
: how my application (happily) seems to be doing what I want. I have two

it's all a question of what you want ... what you've got is throwing
things out with a score less then 0.3 ... but that's an arbitrary
decision -- there is no mathematical basis for assuming a
documentwhich scores "0.31" agaisnt query A is better match on A then a
doc which scores 0.29 against query B is for B ... they are apples and
oranges.

you can be as arbitrary as you want ... you could decide to ignore every
even numbered hit if you want -- it's entirely your choice, but it's not a
ratinal choice.


BTW: i hope you realize based on your comment about not all Hits having a
max score of 1, for some queries, the highest scoring doc might not even
have a score above 0.3, in which case you would be ignoring all matches.


-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: normalized scores

Reply via email to