Engine specific algos

Pat Ferrel Wed, 18 Jun 2014 13:31:27 -0700

Taken from: Re: [jira] [Resolved] (MAHOUT-1153) Implement streaming random 
forests


> Also, we don't have any mappings for Spark Streaming -- so if your
> implementation heavily relies on Spark streaming, i think Spark itself is
> the right place for it to be a part of.

We are discouraging engine specific work? Even dismissing Spark Streaming as a 
whole?

> As it stands we don't have purely (c) methods and indeed i believe these
> methods may be totally engine-specific in which case mllib is one of
> possibly good homes for them. 

Adherence to a specific incarnation of an engine-neutral DSL has become a 
requirement for inclusion in Mahout? The current DSL cannot be extended? Or it 
can’t be extended with engine specific ways? Or it can’t be extended with Spark 
Streaming? I would have thought all of these things desirable otherwise we are 
limiting ourselves to a subset of what an engine can do or a subset of problems 
that the current DSL supports. 

I hope I’m misreading this but it looks like we just discourage a contributor 
from adding post hadoop code in an interesting area to Mahout?

Engine specific algos

Reply via email to