Hi,
I am no longer able to run "bin/nutch crawl ...". I am running "nutch-0.4 built on
25-May-2004 12:17".
Here is what I did after unpacking the release.
1) created a flat urls file with 1 entry
2) updated the conf/crawl-urlfilter.txt file appropriately
3) ran the command "bin/nutch crawl urls -dir crawl.test -depth 3"
here is the output:
[~/nutch-0.4]$bin/nutch crawl urls -dir crawl.test -depth 3
040701 191951 SEVERE nutch-default.xml not found
040701 191951 SEVERE crawl-tool.xml not found
040701 191951 crawl started in: crawl.test
040701 191951 rootUrlFile = urls
040701 191951 threads = 10
040701 191951 depth = 3
040701 191951 serverDelay = 1
040701 191951 Created webdb at crawl.test/db
040701 191951 Starting URL processing
040701 191952 Using URL filter: null
040701 191952 SEVERE error while injecting:java.lang.RuntimeException: Couldn't create
null
040701 191952 FetchListTool started
040701 191952 Overall processing: Sorted 0 entries in 0.0 seconds.
040701 191952 Overall processing: Sorted NaN entries/second
040701 191952 FetchListTool completed
Exception in thread "main" java.lang.NullPointerException
at java.util.StringTokenizer.<init>(StringTokenizer.java:146)
at java.util.StringTokenizer.<init>(StringTokenizer.java:162)
at net.nutch.fetcher.Fetcher.<init>(Fetcher.java:570)
at net.nutch.fetcher.Fetcher.main(Fetcher.java:741)
at net.nutch.tools.CrawlTool.main(CrawlTool.java:101)
Any ideas?
Thanks.
Chris
_______________________________________________
No banners. No pop-ups. No kidding.
Make My Way your home on the Web - http://www.myway.com
-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 -
digital self defense, top technical experts, no vendor pitches,
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers