MoreLikeThis for multiple documents

2007-07-25 Thread Jens Grivolla
Hello, I'm looking to extract significant terms characterizing a set of documents (which in turn relate to a topic). This basically comes down to functionality similar to determining the terms with the greatest offer weight (as used for blind relevance feedback), or maximizing tf.idf (as is

Re: MoreLikeThis for multiple documents

2007-07-26 Thread Grant Ingersoll
I have some sample code for doing relevance feedback across multiple documents at http://www.cnlp.org/apachecon2005 It could be modified to provide more of the MoreLikeThis functionality (i.e. determining important terms via tf/idf) for now it just takes the top X terms -Grant On Jul 25,

Re: MoreLikeThis for multiple documents

2007-07-26 Thread Mathieu Lecarme
Jens Grivolla a écrit : > Hello, > > I'm looking to extract significant terms characterizing a set of > documents (which in turn relate to a topic). > > This basically comes down to functionality similar to determining the > terms with the greatest offer weight (as used for blind relevance > feedba