I start namenode, datenode, jobtracker, tasktracker. And when I try
start commands:

 

1) echo http://cnn.com/  > ./urldir/urls

2) bin/nutch ndfs -put ./urldir /urldir

3) bin/nutch inject /db -urlfile /urldir/urls

 

on last command I get error:

 

051120 090321 Injector: starting

051120 090321 Injector: crawlDb: /db

051120 090321 Injector: urlDir: -urlfile

051120 090321 Injector: Converting injected urls to crawl db entries.

051120 090323 parsing file:/spider_mapred/spider/conf/nutch-default.xml

051120 090323 parsing file:/spider_mapred/spider/conf/mapred-default.xml

051120 090323 parsing file:/spider_mapred/spider/conf/nutch-site.xml

051120 090324 parsing file:/spider_mapred/spider/conf/nutch-default.xml

051120 090324 parsing file:/spider_mapred/spider/conf/nutch-site.xml

051120 090324 Client connection to 192.168.0.250:9010: starting

051120 090324 Client connection to 192.168.0.250:9009: starting

Exception in thread "main" java.io.IOException: No input directories
specified in: NutchConf: nutch-default.xml , mapred-default.xml ,
/spider_mapred/spider/local/jobTracker/job_hmkczb.xml , nutch-site.xml

        at org.apache.nutch.ipc.Client.call(Client.java:294)

        at org.apache.nutch.ipc.RPC$Invoker.invoke(RPC.java:127)

        at $Proxy0.submitJob(Unknown Source)

        at
org.apache.nutch.mapred.JobClient.submitJob(JobClient.java:259)

        at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:288)

        at org.apache.nutch.crawl.Injector.inject(Injector.java:101)

        at org.apache.nutch.crawl.Injector.main(Injector.java:125)

 

what I do make not thus?

 

Reply via email to