MoreLikeThis for multiple documents

Jens Grivolla Wed, 25 Jul 2007 12:16:45 -0700

Hello,

I'm looking to extract significant terms characterizing a set ofdocuments (which in turn relate to a topic).

This basically comes down to functionality similar to determining theterms with the greatest offer weight (as used for blind relevancefeedback), or maximizing tf.idf (as is done in MoreLikeThis).

Is there anything like this already implemented, or do I need to iteratethrough all documents in the set "manually", re-tokenize each one (ormaybe use TermVectors), and then calculate the weight for each term?


Thanks,
   Jens

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

MoreLikeThis for multiple documents

Reply via email to