I haven't given any thought to streaming it, but in case it's useful I do have 
a k-medoids implementation for Spark:
http://silex.freevariable.com/latest/api/#com.redhat.et.silex.cluster.KMedoids

Also a blog post about multi-threading it:
http://erikerlandson.github.io/blog/2015/05/06/parallel-k-medoids-using-scala-parseq/



----- Original Message -----
> Hello everyone,
> 
> I have an idea and I would like to get a validation from community about
> it.
> 
> In Mahout there is an implementation of Streaming K-means. I'm
> interested in your opinion would it make sense to make a similar
> implementation of Streaming K-medoids?
> 
> K-medoids has even bigger problems than K-means because it's not
> scalable, but can be useful in some cases (e.g. It allows more
> sophisticated distance measures).
> 
> What is your opinion about such an approach? Does anyone see problems
> with it?
> 
> Best regards,
> Marko
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to