Hello,


I have a few questions regarding nutch and was

hoping that some kind individual might point me in the 

right direction.



After running "bin/nutch crawl ...." I would like

to access the contents of the fetched pages programmaticaly

for further processing. I would then like to update the

database and the index with the post-processed pages.



I have looked through some of the source code as

well as the java docs, however I am unable to determine

which classes will help me access the page contents from

the database.



Also, is it possible to update the database and index

after processing the fetched pages? If yes, what may

this require?



Thanks for the help.



Chris

_______________________________________________
No banners. No pop-ups. No kidding.
Make My Way your home on the Web - http://www.myway.com


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to