Unable to add/update new files to fetchlist/fetcher and thus index, when u
rerun crawl tool on same db.
-------------------------------------------------------------------------------------------------------
Key: NUTCH-154
URL: http://issues.apache.org/jira/browse/NUTCH-154
Project: Nutch
Type: Bug
Components: fetcher
Versions: 0.7.1
Environment: windows XP pro, nutch 0.7.1, jdk1.4.2 etc.
Reporter: Arun Kumar Sharma
Priority: Blocker
I have modified crawl tool to rerun on same db, I am facing problem when
re-running the crawl tool. Problem I am facing is that it is unable to
fetch/crawl the files which are new additions to the urls. Can anyone suggest
what is possible remedy for that.
with thanx
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers