I got the same error when I ran in my cygwin environment. So I ran it in the windows eclipse environment, it ran OK but I still have some other nutch-0.9 issue to deal with. Please read the following web page: http://wiki.apache.org/nutch/RunNutchInEclipse, and http://lucene.apache.org/nutch/tutorial8.html Then ran it again.
Adam Shuy, President ePacific Web Design & Hosting Professional Web/Software developer TEL: 408-272-6946 www.epacificweb.com -----Original Message----- From: DANIEL CLARK [mailto:[EMAIL PROTECTED] Sent: Friday, June 29, 2007 1:07 PM To: Nutch List Subject: NoRouteToHostException I'm running 0.8.1 and I'm getting the following exception. Any help would be appreciated. $ bin/nutch crawl urls -dir crawl -depth 3 crawl started in: crawl rootUrlDir = urls threads = 10 depth = 3 Injector: starting Injector: crawlDb: crawl/crawldb Injector: urlDir: urls Injector: Converting injected urls to crawl db entries. Exception in thread "main" java.net.NoRouteToHostException: No route to host at java.net.PlainSocketImpl.socketConnect(Native Method) at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333) at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195) at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182) at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366) at java.net.Socket.connect(Socket.java:519) at java.net.Socket.connect(Socket.java:469) at java.net.Socket.<init>(Socket.java:366) at java.net.Socket.<init>(Socket.java:208) at org.apache.hadoop.ipc.Client$Connection.<init>(Client.java:113) at org.apache.hadoop.ipc.Client.getConnection(Client.java:359) at org.apache.hadoop.ipc.Client.call(Client.java:297) at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:150) at org.apache.hadoop.mapred.$Proxy1.getFilesystemName(Unknown Source) at org.apache.hadoop.mapred.JobClient.getFs(JobClient.java:214) at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:248) at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327) at org.apache.nutch.crawl.Injector.inject(Injector.java:138) at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Daniel Clark, President DAC Systems, Inc. 5209 Nanticoke Court Centreville, VA 20120 Cell - (703) 403-0340 Email - [EMAIL PROTECTED] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
