For some reason the nutch process can't resolve the hosts.  This could 
be due to incorrect setup of dns on the machine or a firewall or proxy 
in place.  See if you can ping one of the urls (hosts) that you are 
trying to fetch.

Dennis Kubes

Reza Harditya wrote:
> Hi,
> 
> I'm a new nutch user. Currently I'm using Nutch 0.8.1. When I wanted to
> start crawling according to the tutorial, I always get the following error:
> 
> Injector: starting
> Injector: crawlDb: crawl2/crawldb
> Injector: urlDir: urls
> Injector: Converting injected urls to crawl db entries.
> Exception in thread "main" java.io.IOException: Job failed!
>        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:357)
>        at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
>        at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)
> ------------------------------------------------------------------------------------------------------------
>  
> 
> 
>  From the log, I found a more detailed description which is:
> 
> 2007-05-14 09:32:57,977 INFO  crawl.Injector - Injector: starting
> 2007-05-14 09:32:57,978 INFO  crawl.Injector - Injector: crawlDb:
> crawl2/crawldb
> 2007-05-14 09:32:57,978 INFO  crawl.Injector - Injector: urlDir: urls
> 2007-05-14 09:32:57,978 INFO  crawl.Injector - Injector: Converting 
> injected
> urls to crawl db entries.
> 2007-05-14 09:32:58,908 WARN  mapred.LocalJobRunner - job_lzlk81
> java.lang.RuntimeException: java.net.UnknownHostException: dhcppc0: dhcppc0
>        at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
> :76)
>        at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
> :89)
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:77)
>        at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
> LocalJobRunner.java:91)
> Caused by: java.net.UnknownHostException: dhcppc0: dhcppc0
>        at java.net.InetAddress.getLocalHost(InetAddress.java:1308)
>        at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
> :73)
>        ... 3 more
> 
> 
> At first I suspect that the error was caused by tomcat not running 
> properly,
> but after doing some checking I am confirmed that tomcat is indeed running.
> 
> Could somebody let me know what I might be doing wrong here?
> 
> Cheers,
> 

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to