Hi Philipp,

At 10:49 pm +0100 3/7/07, Paul Elschot wrote:
On Wednesday 07 March 2007 18:12, Philipp Nanz wrote:
 Thanks for your answers. Your input is really appreciated :-)

 @Paul Elschot:
 Thanks for the hint. I guess I could use coord() to penalize missing
 terms like this:

 Query: a b c d
 Doc A: a b c d => sloppyFreq(0) * coord(4, 4) = 1
 Doc B: a b c => sloppyFreq(0) * coord(3, 4) = 0,75

 Doc would score higher. I guess that might be a valid solution.

 There is a drawback though, i.e. sloppyFreq(1) * coord(4, 4) = 0,5

 So a perfect match with one insertion would score less than a 3 of 4
 match with no slop.

Your examples are based on DefaultSimilarity.
With a  Similarity in your Scorer you can leave the tradeoff between these
factors to the user of your query by letting them provide the Similarity
at query time.

[snip]

I'm curious if Paul's input here helped you finish your FuzzyPhraseQuery (or FuzzySpanQuery) addition to Lucene.

Thanks,

-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"If you can't find it, you can't fix it"

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to