Re: Include BM25 in Lucene?

Vic Bancroft Tue, 17 Oct 2006 05:44:34 -0700

J.Zhu wrote:

If I would like to contribute, what should I do? I am not a good Java
developer myself though. Can I work with someone also interested?

In some of my group's usage of lucene over large document collections,we have split the documents across several machines. This has lead to aconcern of whether the inverse document frequency was appropriate, sincethe score seems to be dependant on the partioning of documents overindexing hosts. We have not formulated an experiment to determine if itseriously effects our results, though it has been discussed.

If someone could elaborate how BM25 or some DFR algorithm would differfrom what (TF/IDF) is implemented in lucene, I would be willing to helptranslate that into java as an indexing/searching option . . .


more,
l8r,
v


--
"The future is here. It's just not evenly distributed yet."
-- William Gibson, quoted by Whitfield Diffie


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Include BM25 in Lucene?

Reply via email to