Re: implementatin of the state-of-art retrieval models for lucene?

José Ramón Pérez Agüera Fri, 02 Feb 2007 01:55:01 -0800

Dear Hui,



i'm a Ph. d. student from University Complutense of Madrid (Spain)
where i'm teaching assistant also, in the departament of Artificial
Intelligence. I'm working with Lucene from two years ago, and i'm very
interesting on re-implement certain classes (TermQuery, TermScorer,
DefaultSimilarity) to adapt it to the state-of-art models in
information retrieval BM25, LM, DFR, etc. I'm working also in the
implementation of the evaluation module for Lucene to work with TREC
collections and similars.



I think that would be a good idea if we create a subproject of Lucene
to develop new IR models and differents tools focused to IR community.
I would be very interested on this issue and i think that would be very
intereseting not only for IR comunity but also to Lucene comunity.



What do you think about this idea?



Best



jose

----- Mensaje original -----
De: Hui Fang <[EMAIL PROTECTED]>
Fecha: Viernes, Febrero 2, 2007 5:45 am
Asunto: implementatin of the state-of-art retrieval models for lucene?
A: [email protected]

> Dear all,
> 
> My primary research interest is Information retrieval, with a 
> focus on
> developing
> effective and robust retrieval models. I am happy to send my 
> first email
> to Lucene community.
> 
> Lucene and nutch are really useful IR systems. But I think that 
> the current
> retrieval function
> implemented in Lucene does not perform as well as other state-of-art
> retrieval functions in terms of effectiveness.  I have 
> implemented some
> state-of-art models
> (such as pivoted normalization, okapi and axiomatic retrieval models)
> on top of Lucene, and evaluated these models and the default model
> implemented in
> Lucene using standard IR evaluation methodology. Experiments 
> show that
> the state-of-art retrieval functions outperform the default one.
> Actually, this is one assignment my advisor and I designed for 
> our IR
> course.
> 
> After posting this assignment online, quite a few IR researchers 
> contactedus and
> asked for the code of our implementations.  So, we think that
> it might be beneficial to everyone in the lucene community and 
> IR research
> community,
> if we could contribute our implementation of the state-of-art 
> retrievalfunctions to Lucene.
> I think that our contribution could help improve the retrieval 
> performancefor both
> Lucene and nutch.
> 
> What do you think?
> 
> Thanks,
> -Hui
> 

José Ramón Pérez Agüera

Dept. de Ingeniería del Software e Inteligencia Artificial
Despacho 411 tlf. 913947599
Facultad de Informática
Universidad Complutense de Madrid

Re: implementatin of the state-of-art retrieval models for lucene?

Reply via email to