keywords in a document

Ted Chen Mon, 09 Apr 2007 15:36:25 -0700

Hi, 
    I'm new to Lucene, but I did spend quite some time trying to find an answer 
to the problem before turning to you gurus for help.
    I'm given a text file.  My task is to extract some top keywords from the 
file so that these words can describe this document.  Ideally, terms with the 
highest tfidf should be returned.   I noticed lucene provides a method called 
QueryTermExtractor.getIdfWeightedTerms(), but to use this method, I need to 
provide a query and an IndexReader.  In my case, I only have a text file, and 
don't have a handle on any IndexReader.  Is there any way to indicate I want to 
use the "default IndexReader", where the default reader represents a typical 
collection of documents.  Also, do I need to construct a Query out of the text 
file?  If so, is the best choice a multi-term query?
    Any suggestions?


Thanks,
Ted


 
____________________________________________________________________________________
Sucker-punch spam with award-winning protection. 
Try the free Yahoo! Mail Beta.
http://advision.webevents.yahoo.com/mailbeta/features_spam.html

keywords in a document

Reply via email to