Hi,
I want to use Nutch for crawling contents and Lucene for extract and analyze
the contents of the index created by Nutch. I'm trying to extract from the
index the contents of web pages, but i don' know how to set the
NutchDocumentAnalyzer in my application, if i use the StandardAnalyzer of
Lucene, i'll get to extract the fields "title", "url" but not the "content".
I'm using Nutch1.0 and Lucene2.4.0


-- 
View this message in context: 
http://www.nabble.com/Using-Nutch-for-crawling-and-Lucene-for-searching-%28Wildcard-Fuzzy%29-tp19990219p23536068.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to