Hi guys,
I have been running Nutch quite well the past few days. Now when I run the
same program again, I keep on getting the following errors below. I never
had to configure hadoop-site.xml. I am running Nutch on windows through Ant
and a Java program. It always worked until today. I have downloaded a new
copy of Nutch and the same error keep on appearing. I am using Nutch 0.8.1,
when I run Nutch through Cygwin, the same problems appears. The
hadoop-site.xml is empty but I understand that if I'm not running on a
distributed environment, I should not worry about configuring it. I am
running Nutch on my local file system. Someone give me hand on this.
Thanks in advance.
2006-11-22 17:39:10,171 INFO crawl.LinkDb - LinkDb: starting
2006-11-22 17:39:10,171 INFO crawl.LinkDb - LinkDb: linkdb:
testcrawl/linkdb
2006-11-22 17:39:10,250 WARN mapred.LocalJobRunner - job_gpz1p7
java.io.IOException: No input directories specified in: Configuration:
defaults: hadoop-default.xml , mapred-default.xml ,
/tmp/hadoop/mapred/local/localRunner/job_gpz1p7.xmlfinal: hadoop-site.xml
at
org.apache.hadoop.mapred.InputFormatBase.listPaths(InputFormatBase.java:96)
at
org.apache.hadoop.mapred.SequenceFileInputFormat.listPaths(SequenceFileInput
Format.java:37)
at
org.apache.hadoop.mapred.InputFormatBase.getSplits(InputFormatBase.java:116)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:80)
Armel
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers