Re: Include BM25 in Lucene?

2006-10-17 Thread Grant Ingersoll
Hi Jianhan, I am not aware, however, of anyone working on a BM25 implementation. We are a volunteer project, though, so we are always open to contributions! -Grant On Oct 17, 2006, at 5:50 AM, J.Zhu wrote: Hi, All, I am an enthusiastic user of Lucene and it is very helpful to my proje

RE: Include BM25 in Lucene?

2006-10-17 Thread J.Zhu
Subject: Re: Include BM25 in Lucene? Hi Jianhan, I am not aware, however, of anyone working on a BM25 implementation. We are a volunteer project, though, so we are always open to contributions! -Grant On Oct 17, 2006, at 5:50 AM, J.Zhu wrote: > Hi, All, > > I am an enthusiastic user

Re: Include BM25 in Lucene?

2006-10-17 Thread Vic Bancroft
J.Zhu wrote: If I would like to contribute, what should I do? I am not a good Java developer myself though. Can I work with someone also interested? In some of my group's usage of lucene over large document collections, we have split the documents across several machines. This has lead to a

RE: Include BM25 in Lucene?

2006-10-17 Thread J.Zhu
EMAIL PROTECTED] Sent: 17 October 2006 13:44 To: java-dev@lucene.apache.org; [EMAIL PROTECTED] Subject: Re: Include BM25 in Lucene? J.Zhu wrote: >If I would like to contribute, what should I do? I am not a good Java >developer myself though. Can I work with someone also interested? >

Re: Include BM25 in Lucene?

2006-10-17 Thread Chuck Williams
Vic Bancroft wrote on 10/17/2006 02:44 AM: > In some of my group's usage of lucene over large document collections, > we have split the documents across several machines. This has lead to > a concern of whether the inverse document frequency was appropriate, > since the score seems to be dependant

Re: Include BM25 in Lucene?

2006-10-19 Thread Vic Bancroft
Chuck Williams wrote: Vic Bancroft wrote on 10/17/2006 02:44 AM: In some of my group's usage of lucene over large document collections, we have split the documents across several machines. This has lead to a concern of whether the inverse document frequency was appropriate, since the score