I haven't given any thought to streaming it, but in case it's useful I do have a k-medoids implementation for Spark: http://silex.freevariable.com/latest/api/#com.redhat.et.silex.cluster.KMedoids
Also a blog post about multi-threading it: http://erikerlandson.github.io/blog/2015/05/06/parallel-k-medoids-using-scala-parseq/ ----- Original Message ----- > Hello everyone, > > I have an idea and I would like to get a validation from community about > it. > > In Mahout there is an implementation of Streaming K-means. I'm > interested in your opinion would it make sense to make a similar > implementation of Streaming K-medoids? > > K-medoids has even bigger problems than K-means because it's not > scalable, but can be useful in some cases (e.g. It allows more > sophisticated distance measures). > > What is your opinion about such an approach? Does anyone see problems > with it? > > Best regards, > Marko > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org