Hi all
I get a error message and do not know what's not working? Is the error
that the fetcher seeks for segment "ls"?
I just followed the manual...
Thx for your help
Josa
r...@ubuntu-904-jaunty-32-minimal:/home/nutch# s1='ls -d
crawl/segments/2* | tail -1'
r...@ubuntu-904-jaunty-32-minimal:/home/nutch# bin/nutch fetch $s1
Fetcher: Your 'http.agent.name' value should be listed first in
'http.robots.agents' property.
Fetcher: starting
Fetcher: segment: ls
Exception in thread "main"
org.apache.hadoop.mapred.InvalidInputException: Input path does not
exist: file:/home/nutch/ls/crawl_generate
at
org.apache.hadoop.mapred.FileInputFormat.listStatus(FileInputFormat.java:179)
at
org.apache.hadoop.mapred.SequenceFileInputFormat.listStatus(SequenceFileInputFormat.java:39)
at
org.apache.nutch.fetcher.Fetcher$InputFormat.getSplits(Fetcher.java:101)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:797)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1142)
at org.apache.nutch.fetcher.Fetcher.fetch(Fetcher.java:969)
at org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:1003)
r...@ubuntu-904-jaunty-32-minimal:/home/nutch#