Hi there,
Has anyone configured carrot2 with nutch 0.8.1. I have the following
configuration in nutch-default.xml for carrot2, but everytime I try to
acquire the OnlineClusterer class it comes back as null.
clusterer = new
OnlineClustererFactory(nutchConf).getOnlineClusterer();
<property>
<name>plugin.includes</name>
<value>clustering-carrot2|protocol-http|urlfilter-regex|parse-(text|html|js|pdf)|index-basic|query-(basic|site|url)|summary-basic|scoring-opic</value>
<description>Regular expression naming plugin directory names to
include. Any plugin not matching this expression is excluded.
In any case you need at least include the nutch-extensionpoints plugin. By
default Nutch includes crawling just HTML and plain text via HTTP,
and basic indexing and search plugins.
</description>
</property>
<property>
<name>extension.clustering.extension-name</name>
<value></value>
<description>Use the specified online clustering extension. If empty,
the first available extension will be used. The "name" here refers to an
'id'
attribute of the 'implementation' element in the plugin descriptor XML
file.</description>
</property>
Any ideas, comments, thanks.
--
View this message in context:
http://www.nabble.com/Carrot2-clustering-help-tp24905920p24905920.html
Sent from the Nutch - User mailing list archive at Nabble.com.