Caused by: java.net.UnknownHostException: dhcppc0: dhcppc0
at java.net.InetAddress.getLocalHost(InetAddress.java:1308)
at org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java
:73)
Could it be that it is because I have an installation of apache and tomcat
in the host that I've installed Nutch and it cannot determine whether
'localhost' points to the apache or tomcat? Or does it matter anyway?
I have both servers(apache and tomcat) listening on the default port# which
is 80 and 8080.
On 5/14/07, Reza Harditya <[EMAIL PROTECTED]> wrote:
I have checked and confirmed that the hosts I'm trying to fetch are
actually accessible (ping requests and loading the site itself). However, I
still get the same error.
Any other alternatives?
On 5/14/07, Dennis Kubes <[EMAIL PROTECTED]> wrote:
>
> For some reason the nutch process can't resolve the hosts. This could
> be due to incorrect setup of dns on the machine or a firewall or proxy
> in place. See if you can ping one of the urls (hosts) that you are
> trying to fetch.
>
> Dennis Kubes
>
> Reza Harditya wrote:
> > Hi,
> >
> > I'm a new nutch user. Currently I'm using Nutch 0.8.1. When I wanted
> to
> > start crawling according to the tutorial, I always get the following
> error:
> >
> > Injector: starting
> > Injector: crawlDb: crawl2/crawldb
> > Injector: urlDir: urls
> > Injector: Converting injected urls to crawl db entries.
> > Exception in thread "main" java.io.IOException : Job failed!
> > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java
> :357)
> > at org.apache.nutch.crawl.Injector.inject(Injector.java:138)
> > at org.apache.nutch.crawl.Crawl.main (Crawl.java:105)
> >
>
------------------------------------------------------------------------------------------------------------
> >
> >
> > From the log, I found a more detailed description which is:
> >
> > 2007-05-14 09:32:57,977 INFO crawl.Injector - Injector: starting
> > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: crawlDb:
> > crawl2/crawldb
> > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: urlDir: urls
> > 2007-05-14 09:32:57,978 INFO crawl.Injector - Injector: Converting
> > injected
> > urls to crawl db entries.
> > 2007-05-14 09:32:58,908 WARN mapred.LocalJobRunner - job_lzlk81
> > java.lang.RuntimeException: java.net.UnknownHostException: dhcppc0:
> dhcppc0
> > at org.apache.hadoop.io.SequenceFile$Writer.<init>(
> SequenceFile.java
> > :76)
> > at org.apache.hadoop.io.SequenceFile$Writer .<init>(
> SequenceFile.java
> > :89)
> > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:77)
> > at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
> > LocalJobRunner.java:91)
> > Caused by: java.net.UnknownHostException: dhcppc0: dhcppc0
> > at java.net.InetAddress.getLocalHost(InetAddress.java:1308)
> > at org.apache.hadoop.io.SequenceFile$Writer.<init>(
> SequenceFile.java
> > :73)
> > ... 3 more
> >
> >
> > At first I suspect that the error was caused by tomcat not running
> > properly,
> > but after doing some checking I am confirmed that tomcat is indeed
> running.
> >
> > Could somebody let me know what I might be doing wrong here?
> >
> > Cheers,
> >
>
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general