Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by ThiloPfennig: http://wiki.apache.org/nutch/GettingNutchRunningWithFedoraCore ------------------------------------------------------------------------------ * Test using http://lucene.apache.org/nutch/tutorial.html + 1. make a new dir `urls` - 1. add an url in a new file "urls" + 1. add an url in a new file 'urls/nutch' - 1. add/edit conf/crawl-urlfilter.txt (under # accept hosts in MY.DOMAIN.NAME ) + 1. add/edit `conf/crawl-urlfilter.txt' (under # accept hosts in MY.DOMAIN.NAME ) - - - '''result:''' - {{{ - Exception in thread "main" java.io.IOException: Input directory /home/vinci/Down - loads/nutch-0.8/urls in local is invalid. - at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274) - at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327) - at org.apache.nutch.crawl.Injector.inject(Injector.java:138) - at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) - }}} + - --- + ---- <<< FrontPage