Dedup process are quite usefull, unfortunetely the url of the content
deleted are not removed from the Crawldb.
Don't u think we could either remove it from the DB or change the status and
fetchinterval to avoid to fetch it again so quickly ?
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general