Howie Wang wrote:
I was wondering how to recover from a bad fetch. Should I consider the segment corrupt and just delete it? Then should I reset the fetch date in the webdb so that it will refetch it?
Unfinished segments are ok, you can use them for further processing. Of course, the parts that are not fetched won't be processed at all, so those Pages in WebDB won't get updated and you will have to wait another week (or use -adddays).
-- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
