I am getting this error when I am trying to run the inject:
I have done this:
mkdir dmoz
bin/nutch org.apache.nutch.tools.DmozParser content.rdf.u8 -subset
5000 > dmoz/urls
And an error here:
bin/nutch inject crawl/crawldb dmoz
2007-06-02 02:37:19,796 WARN plugin.PluginRepository - Plugins: not a
file: url. Can't load plugins from:
jar:file:/C:/Berlin/Downloads4/workspaceTrunk/BotListProjects/botcrawl/nutch/nutch-0.9.job!/plugins
2007-06-02 02:37:19,812 INFO plugin.PluginRepository - Plugin
Auto-activation mode: [true]
2007-06-02 02:37:19,812 INFO plugin.PluginRepository - Registered Plugins:
2007-06-02 02:37:19,812 INFO plugin.PluginRepository - NONE
2007-06-02 02:37:19,812 INFO plugin.PluginRepository - Registered
Extension-Points:
2007-06-02 02:37:19,812 INFO plugin.PluginRepository - NONE
2007-06-02 02:37:19,812 WARN mapred.LocalJobRunner - job_5ysi6h
java.lang.RuntimeException: x point org.apache.nutch.net.URLNormalizer
not found.
at org.apache.nutch.net.URLNormalizers.<init>(URLNormalizers.java:120)
at org.apache.nutch.crawl.Injector$InjectMapper.configure(Injecto
--
Berlin Brown
http://www.newspiritcompany.com - newspirit technologies
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general