I have a question for the OpenNLP project team.

I was wondering if there is a Naive Bayesian classifier implementation in
OpenNLP that I've not come across, or if there are plans to implement one.

If it is the latter, I should love to contribute an implementation.

There is an ME classifier already available in OpenNLP, of course, but I
felt that there was an unmet need for a Naive Bayesian (NB) classifier
implementation to be offered as well.

An NB classifier could be bootstrapped up with partially labelled training
data as explained in the Nigam, McCallum, et al paper of 2000 "Text
Classification from Labeled and Unlabeled Documents using EM".

So, if there isn't an NB code base out there already, I'd be happy to
contribute a very solid implementation that we've used in production for a
good 5 years.

I'd have to adapt it to load the same training data format as the ME
classifier, but I guess that shouldn't be very difficult to do.

I was wondering if there was some interest in adding an NB implementation
and I'd love to know who could I coordinate with if there is?

Cohan Sujay Carlos
CEO, Aiaioo Labs, India
+91-77605-80015 +91-80-4125-0730

Reply via email to