Hey guys, I'm running a crawl over my lighttpd server, and nutch is repeatedly fetching the pages specified in the urls directory, and making no progress from there. It is crawling the tomcat server fine, just not the lighttpd one. Has anyone come across this problem before? I've run using 0.8 and 0.9, with the same results.
If you would like to see any of my configuration settings just ask. ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
