Hi Nutch Developers,
I am using Nutch to do behavior-crawling (made changes to nutch files and add
some of my own files) using java. All the files were coded and built using
Ecplise.
I am running Nutch on Tomcat and coded it with Java. However when i request
Nutch to start crawling, i encountered the following error:
LocalFileSystem: can't delete C:\Program Files\Apache\Tomcat
5.5\webapps\dataUTF8\BLOG_SINA\20070307235111\db\webdb.new\pagesByURL\data
WebDBWriter: serious problem at line 1677
Step 5: generate a new segment
LocalFileSystem: can't delete C:\Program Files\Apache\Tomcat
5.5\webapps\dataUTF8\BLOG_SINA\20070307235111\db\webdb.new\pagesByURL\data
WebDBWriter: serious problem at line 1677
Step 6: fetch
Exception in thread "fetcher0" java.lang.ExceptionInInitializerError
at org.apache.nutch.fetcher.Fetcher$FetcherThread.handleFetch(Fetcher.java:281)
at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:219)
Caused by: java.lang.RuntimeException: x point org.apache.nutch.parse.Parser
not found.
at org.apache.nutch.parse.ParserFactory.<clinit>(ParserFactory.java:40)
... 2 more
Step 7: update database
I have no idea how to resolve this issue. Pls advise. Thanks.
P/S: the plugins parameter in the Nutch-site.xml has been changed to the
absolute pathname of the folders which has all the plugins.
Best Regards,
Bing Jin, Khoo
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general