Hi there, I am using nutch-0.8.1 and I have 5 custom plugins that I am using. All of those plugins seem to get used from the logs but one of them is not being used. Also, the urls it was written for are also skipped altogether.
Here are some pieces from hadoop.log file 2009-05-07 14:27:41,227 INFO plugin.PluginRepository - Registered Plugins: ..... ......... 2009-05-07 14:27:41,228 INFO plugin.PluginRepository - Xenbase Indexer (index-xenbase) 2009-05-07 14:27:41,228 INFO plugin.PluginRepository - Article Display Page Parser (parse-articlePage) The last plugin --> parse-articlePage is never used. I wrote this plugin to index urls of the type http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=670 Again, these urls get fetched but never indexed. hadoop.log file shows 2009-05-07 14:32:23,048 INFO fetcher.Fetcher - fetching http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=5966 2009-05-07 14:32:23,049 INFO fetcher.Fetcher - fetching http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=9196 2009-05-07 14:32:23,051 INFO fetcher.Fetcher - fetching http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=6247 2009-05-07 14:32:23,052 INFO fetcher.Fetcher - fetching http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=6223 Am I missing some configuration, or is there a bug in the plugin, I don't see any exceptions being thrown. Thanks for any pointers. -- View this message in context: http://www.nabble.com/Registered-plugin-never-invoked-and-urls-skipped-tp23435093p23435093.html Sent from the Nutch - User mailing list archive at Nabble.com.
