I think I understand now. I also have evidence from literature. So I would say that my question is solved. :)
Thank you, Otis, and everybody else for contributing! Karl -------- Original-Nachricht -------- Datum: Thu, 14 Dec 2006 09:40:31 +0100 Von: Soeren Pekrul <[EMAIL PROTECTED]> An: java-user@lucene.apache.org Betreff: Re: Lucene scoring: coord_q_d factor > Karl Koch wrote: > > If I do not misunderstand that extract, I would say it suggests the > combination of coordination level matching with IDF. I am interested in your > view and those who read this? > > I understand that sentence: > "The natural solution is to correlate a term's matching value with its > collection frequency." > exactly in that way, to combine coordination level matching with IDF. > > The score for a document is the sum of the term weights w(tf, idf) for > each containing term. So you have already the combination of > coordination level matching with IDF. Now it is possible that your query > requests three terms A, B and C. Two of them (A and B) are quite often > in the collection one (C) is very rare. It could be possible that > documents are matching just C have a higher score than documents > containing A and B. To avoid this you can give the coordination a higher > influence by multiplying the sum of term weights with the coordination > as additional factor. > > Sören > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] -- "Ein Herz für Kinder" - Ihre Spende hilft! Aktion: www.deutschlandsegelt.de Unser Dankeschön: Ihr Name auf dem Segel der 1. deutschen America's Cup-Yacht! --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]