Hello,
I have a few questions regarding nutch and was hoping that some kind individual might point me in the right direction. After running "bin/nutch crawl ...." I would like to access the contents of the fetched pages programmaticaly for further processing. I would then like to update the database and the index with the post-processed pages. I have looked through some of the source code as well as the java docs, however I am unable to determine which classes will help me access the page contents from the database. Also, is it possible to update the database and index after processing the fetched pages? If yes, what may this require? Thanks for the help. Chris _______________________________________________ No banners. No pop-ups. No kidding. Make My Way your home on the Web - http://www.myway.com ------------------------------------------------------- This SF.Net email sponsored by Black Hat Briefings & Training. Attend Black Hat Briefings & Training, Las Vegas July 24-29 - digital self defense, top technical experts, no vendor pitches, unmatched networking opportunities. Visit www.blackhat.com _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
