Streaming K-means

2015-06-01 Thread Marko Dinic
Hello everyone, I have an idea and I would like to get a validation from community about it. In Mahout there is an implementation of Streaming K-means. I'm interested in your opinion would it make sense to make a similar implementation of Streaming K-medoids? K-medoids has even bigger

Re: Regression using MapReduce

2015-06-01 Thread Punit Naik
So is the MapReduce implementation of Regression written in Mahout-Samsara? I have written stable MapReduce codes for Regression, that is why I was interested. Thank You Punit Naik On Sun, May 31, 2015 at 9:22 PM, Scott Lett, PhD sl...@holisticmath.com wrote: Ted, Where is the first place to

Updated AMI for EMR

2015-06-01 Thread Andrew Musselman
AWS will be releasing a new AMI in July that will include our 0.10.1 release.

Re: Updated AMI for EMR

2015-06-01 Thread Suneel Marthi
Highly likely that there will be another 0.10.x out by July, will they be pulling off the latest ? On Mon, Jun 1, 2015 at 2:18 PM, Andrew Musselman andrew.mussel...@gmail.com wrote: AWS will be releasing a new AMI in July that will include our 0.10.1 release.

Re: Regression using MapReduce

2015-06-01 Thread Pat Ferrel
There is a menu full of references starting here: http://mahout.apache.org/users/sparkbindings/home.html Mahout-Samsara is a Linear Algebra DSL written as an extension of Scala that includes most of the R-Like primitives you need. As opposed to MLlib, which has a good number of algorithms but

Re: Streaming K-means

2015-06-01 Thread Ted Dunning
The streaming k-means works by building a sketch of the data which is then used to do real clustering. It might be that this sketch would be acceptable to do k-medoids, but that is definitely not guaranteed. Similarly, it might be possible to build a medoid sketch instead of a mean based sketch,