Hi,


I am no longer able to run "bin/nutch crawl ...". I am running  "nutch-0.4 built on 
25-May-2004 12:17".



Here is what I did after unpacking the release.

1) created a flat urls file with 1 entry

2) updated the conf/crawl-urlfilter.txt file appropriately

3) ran the command "bin/nutch crawl urls -dir crawl.test -depth 3"



here is the output:



[~/nutch-0.4]$bin/nutch crawl urls -dir crawl.test -depth 3

040701 191951 SEVERE nutch-default.xml not found

040701 191951 SEVERE crawl-tool.xml not found

040701 191951 crawl started in: crawl.test

040701 191951 rootUrlFile = urls

040701 191951 threads = 10

040701 191951 depth = 3

040701 191951 serverDelay = 1

040701 191951 Created webdb at crawl.test/db

040701 191951 Starting URL processing

040701 191952 Using URL filter: null

040701 191952 SEVERE error while injecting:java.lang.RuntimeException: Couldn't create 
null

040701 191952 FetchListTool started

040701 191952 Overall processing: Sorted 0 entries in 0.0 seconds.

040701 191952 Overall processing: Sorted NaN entries/second

040701 191952 FetchListTool completed

Exception in thread "main" java.lang.NullPointerException

        at java.util.StringTokenizer.<init>(StringTokenizer.java:146)

        at java.util.StringTokenizer.<init>(StringTokenizer.java:162)

        at net.nutch.fetcher.Fetcher.<init>(Fetcher.java:570)

        at net.nutch.fetcher.Fetcher.main(Fetcher.java:741)

        at net.nutch.tools.CrawlTool.main(CrawlTool.java:101)





 

Any ideas?



Thanks.

Chris







_______________________________________________
No banners. No pop-ups. No kidding.
Make My Way your home on the Web - http://www.myway.com


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to