I think you will have problems there.  The problem is that the iris data set
has 5 or so continuous variables and naive bayes really only likes sparse
binary features.

There is a patch at https://issues.apache.org/jira/browse/MAHOUT-228 that
gives you the beginnings of an online logistic regression classifier (you
will need to be ambitious to use that).

The random forest implementation already in mahout can handle continuous
variables as well.

Neither of these other two implementations is nearly as polished (yet) as
the Naive Bayes stuff.

On Tue, Jun 8, 2010 at 1:55 AM, Robin Anil <[email protected]> wrote:

> I am trying to use mahout's bayesian classifier over the iris dataset.
>

Reply via email to