Yes, those nodes are running tasks. For Logistic Regression, it's reasonable as 
this algorithm is
only sequential implementation. But for Naive Bayes and Random Forest, it's 
hard to understand. By the way, how do I know/check if I am running the 
distributed implementation of these algorithms? Any command line options or 
conf settings exist there? Thanks.

Dong


发自我的 iPad

> 在 May 27, 2014,8:22 PM,Jay Vyas <jayunit...@gmail.com> 写道:
> 
> have you verified that all the slaves are running tasks?  sometimes only a
> few slaves on a cluster willl pick up a task because of other limitations.
> Also some algorithms in mahout arent distribnuted.
> also obviously you will want to make sure that your running the distributed
> implementations of these algorithms -
> 
> 
>> On Tue, May 27, 2014 at 8:45 PM, dongdan39 <dongda...@gmail.com> wrote:
>> 
>> Hi, Expert,
>>  I'm confused about the runtime of mahout on e.g Random Forest(the same
>> with Logistic Regression): no matter how I set the number of slaves from 2,
>> 8 to 20 in conf/slaves in Hadoop,
>> the runtime of the program are basically the same. Shouldn't it be faster
>> when the program runs on more machines? Any hint?
>> 
>>  Regards, Dong
> 
> 
> -- 
> Jay Vyas
> http://jayunit100.blogspot.com

Reply via email to