Taken from: Re: [jira] [Resolved] (MAHOUT-1153) Implement streaming random forests
> Also, we don't have any mappings for Spark Streaming -- so if your > implementation heavily relies on Spark streaming, i think Spark itself is > the right place for it to be a part of. We are discouraging engine specific work? Even dismissing Spark Streaming as a whole? > As it stands we don't have purely (c) methods and indeed i believe these > methods may be totally engine-specific in which case mllib is one of > possibly good homes for them. Adherence to a specific incarnation of an engine-neutral DSL has become a requirement for inclusion in Mahout? The current DSL cannot be extended? Or it can’t be extended with engine specific ways? Or it can’t be extended with Spark Streaming? I would have thought all of these things desirable otherwise we are limiting ourselves to a subset of what an engine can do or a subset of problems that the current DSL supports. I hope I’m misreading this but it looks like we just discourage a contributor from adding post hadoop code in an interesting area to Mahout?
