Random forest works as a map-reduce program, but that does not produce arbitrary scalability.
The Naive Bayes classifier is relatively natural as a map-reduce program and has a map-reduce version. The linear classifiers like linear regression do not have map-reduce versions (yet) since there is some difficulty in getting these to work well. On Sun, Dec 25, 2011 at 5:59 AM, Lingxiang Cheng <[email protected]>wrote: > Hi, > > I am a newbie to Mahout. When I was reading the book "Mahout in > Action", I found chapters talking about how clustering naturally fit into > Map/Reduce framework, but I did not see the same claim for classifiers. > Does it involve a lot of work to make classifiers like random forest work > with Hadoop? > > Thanks! > Lingxiang Cheng
