Hi,
I'm new to Lucene, but I did spend quite some time trying to find an answer
to the problem before turning to you gurus for help.
I'm given a text file. My task is to extract some top keywords from the
file so that these words can describe this document. Ideally, terms with the
highest tfidf should be returned. I noticed lucene provides a method called
QueryTermExtractor.getIdfWeightedTerms(), but to use this method, I need to
provide a query and an IndexReader. In my case, I only have a text file, and
don't have a handle on any IndexReader. Is there any way to indicate I want to
use the "default IndexReader", where the default reader represents a typical
collection of documents. Also, do I need to construct a Query out of the text
file? If so, is the best choice a multi-term query?
Any suggestions?
Thanks,
Ted
____________________________________________________________________________________
Sucker-punch spam with award-winning protection.
Try the free Yahoo! Mail Beta.
http://advision.webevents.yahoo.com/mailbeta/features_spam.html