I would not recommend doing this because absolute score values in Lucene
are not meaningful (e.g., scores are not directly comparable across
searches).  The ratio of a score to the highest score returned is
meaningful, but there is no absolute calibration for the highest score
returned, at least at present, so there is not a way to determine from
the scores what the quality of the result set is overall.  There are
various approaches to improving this that have been discussed (making
the scores more directly comparable by encoding additional information
into the score and using that for normalization, or probably better,
generalizing the score to an object that contains multiple pieces of
information; e.g. the total number of query terms matched by the top
result if you are using default OR would be quite useful).  None of
these ideas are implemented yet as far as I know.

Chuck

  > -----Original Message-----
  > From: Jay Hill [mailto:[EMAIL PROTECTED]
  > Sent: Monday, February 14, 2005 11:08 AM
  > To: lucene-user@jakarta.apache.org
  > Subject: Limiting Hits with a score threshold
  > 
  > Does anyone have an example of limiting results returned based on a
  > score threshold? For example if I'm only interested in documents
with
  > a score > 0.05.
  > 
  > Thanks,
  > -Jay
  > 
  >
---------------------------------------------------------------------
  > To unsubscribe, e-mail: [EMAIL PROTECTED]
  > For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to