I am a complete beginner in Nutch. I know it is possible to do world wide web
crawling in nutch but how do I do it? Till now, I tried to follow the
tutorials on the web like "Introduction to Nutch..." but for some reason my
log file is showing that "Input directory in local is invalid". So, I am
assuming that I am trying to do an intranet crawl insted of an internet
crawl which causes the exception but how do I do an internet crawl? Any help
will be appreciated. Thanks.
-- 
View this message in context: 
http://www.nabble.com/Nutch-world-wide-web-crawling-tf3785927.html#a10706430
Sent from the Nutch - User mailing list archive at Nabble.com.


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to