For some reason the nutch process can't resolve the hosts. This could be due to incorrect setup of dns on the machine or a firewall or proxy in place. See if you can ping one of the urls (hosts) that you are trying to fetch.
Dennis Kubes Reza Harditya wrote: > Hi, > > I'm a new nutch user. Currently I'm using Nutch 0.8.1. When I wanted to > start crawling according to the tutorial, I always get the following error: > > Injector: starting > Injector: crawlDb: crawl2/crawldb > Injector: urlDir: urls > Injector: Converting injected urls to crawl db entries. > Exception in thread "main" java.io.IOException: Job failed! > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:357) > at org.apache.nutch.crawl.Injector.inject(Injector.java:138) > at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) > ------------------------------------------------------------------------------------------------------------ > > > > From the log, I found a more detailed description which is: > > 2007-05-14 09:32:57,977 INFO crawl.Injector - Injector: starting > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: crawlDb: > crawl2/crawldb > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: urlDir: urls > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: Converting > injected > urls to crawl db entries. > 2007-05-14 09:32:58,908 WARN mapred.LocalJobRunner - job_lzlk81 > java.lang.RuntimeException: java.net.UnknownHostException: dhcppc0: dhcppc0 > at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java > :76) > at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java > :89) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:77) > at org.apache.hadoop.mapred.LocalJobRunner$Job.run( > LocalJobRunner.java:91) > Caused by: java.net.UnknownHostException: dhcppc0: dhcppc0 > at java.net.InetAddress.getLocalHost(InetAddress.java:1308) > at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java > :73) > ... 3 more > > > At first I suspect that the error was caused by tomcat not running > properly, > but after doing some checking I am confirmed that tomcat is indeed running. > > Could somebody let me know what I might be doing wrong here? > > Cheers, > ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
