Hi,
I'm a new nutch user. Currently I'm using Nutch 0.8.1. When I wanted to
start crawling according to the tutorial, I always get the following error:
Injector: starting
Injector: crawlDb: crawl2/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Exception in thread "main" java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:357)
at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:105)
------------------------------------------------------------------------------------------------------------
From the log, I found a more detailed description which is:
2007-05-14 09:32:57,977 INFO crawl.Injector - Injector: starting
2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: crawlDb:
crawl2/crawldb
2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: urlDir: urls
2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: Converting injected
urls to crawl db entries.
2007-05-14 09:32:58,908 WARN mapred.LocalJobRunner - job_lzlk81
java.lang.RuntimeException: java.net.UnknownHostException: dhcppc0: dhcppc0
at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
:76)
at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
:89)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:77)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
LocalJobRunner.java:91)
Caused by: java.net.UnknownHostException: dhcppc0: dhcppc0
at java.net.InetAddress.getLocalHost(InetAddress.java:1308)
at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
:73)
... 3 more
At first I suspect that the error was caused by tomcat not running properly,
but after doing some checking I am confirmed that tomcat is indeed running.
Could somebody let me know what I might be doing wrong here?
Cheers,
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general