I am a complete beginner in Nutch. I know it is possible to do world wide web crawling in nutch but how do I do it? Till now, I tried to follow the tutorials on the web like "Introduction to Nutch..." but for some reason my log file is showing that "Input directory in local is invalid". So, I am assuming that I am trying to do an intranet crawl insted of an internet crawl which causes the exception but how do I do an internet crawl? Any help will be appreciated. Thanks. -- View this message in context: http://www.nabble.com/Nutch-world-wide-web-crawling-tf3785927.html#a10706430 Sent from the Nutch - User mailing list archive at Nabble.com.
------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
