You can try adaboost in patch 716 :) Sent from my iPad
On Jun 2, 2011, at 7:40 AM, "Lancaster, Robert (Orbitz)" <[email protected]> wrote: > I'm looking at the Mahout implementation NaiveBayes for a classification > task, but the language around the Mahout implementation appears to be > document-centric. Is it possible to use the Mahout implementation of NB for > a classification task that doesn't involve documents? > > I have about 80 million records with a small number of features. The arff > header looks like (the numeric features could easily be nominalized if need > be): > > @RELATION relation > @ATTRIBUTE featurea NUMERIC > @ATTRIBUTE featureb {1,2,3,4,5,6,7} > @ATTRIBUTE featurec {1,2,3,4,5,6,7} > @ATTRIBUTE featured NUMERIC > @ATTRIBUTE featuref NUMERIC > @ATTRIBUTE featuref {0,1} > @ATTRIBUTE target {0,1}
