Hi there,
I am using nutch-0.8.1 and I have 5 custom plugins that I am using. All of
those plugins seem to get used from the logs but one of them is not being
used. Also, the urls it was written for are also skipped altogether. 

Here are some pieces from hadoop.log file
2009-05-07 14:27:41,227 INFO  plugin.PluginRepository - Registered Plugins:
.....
.........
2009-05-07 14:27:41,228 INFO  plugin.PluginRepository -         Xenbase Indexer
(index-xenbase)
2009-05-07 14:27:41,228 INFO  plugin.PluginRepository -         Article Display
Page Parser (parse-articlePage)

The last plugin --> parse-articlePage is never used.

I wrote this plugin to index urls of the type 
http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=670

Again, these urls get fetched but never indexed.
hadoop.log file shows 
2009-05-07 14:32:23,048 INFO  fetcher.Fetcher - fetching
http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=5966
2009-05-07 14:32:23,049 INFO  fetcher.Fetcher - fetching
http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=9196
2009-05-07 14:32:23,051 INFO  fetcher.Fetcher - fetching
http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=6247
2009-05-07 14:32:23,052 INFO  fetcher.Fetcher - fetching
http://xlaevis.cpsc.ucalgary.ca/literature/article.do?method=display&articleId=6223

Am I missing some configuration, or is there a bug in the plugin, I don't
see any exceptions being thrown.

Thanks for any pointers.


-- 
View this message in context: 
http://www.nabble.com/Registered-plugin-never-invoked-and-urls-skipped-tp23435093p23435093.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to