Spark K means number of Iterations?

2015-09-12 Thread ashensw
H all, I want to know whether the K means algorithm stops if the data set converges to stable clusters before reaching the number of iteration that we defined? As an example if I give 100 as the number of iterations and 5 as the number of clusters, but if the data set converges to stable 5

Re: Spark K means number of Iterations?

2015-09-12 Thread ashensw
Okay. Thanks for the help. On Sat, Sep 12, 2015 at 1:16 PM, Robineast [via Apache Spark User List] < ml-node+s1001560n24665...@n3.nabble.com> wrote: > Yes it does stop if the algorithm converges in less than the specified > tolerance. You have a parameter to the Kmeans constructor called epsilon

Distance Calculation in Spark K means clustering

2015-08-31 Thread ashensw
Hi all, I am currently working on some K means clustering project. I want to get the distances of each data point to it's cluster center after building the K means model. Currently I get the cluster centers of each data point by sending the JavaRDD which includes all the data points to K means

Calculating Min and Max Values using Spark Transformations?

2015-08-28 Thread ashensw
Hi all, I have a dataset which consist of large number of features(columns). It is in csv format. So I loaded it into a spark dataframe. Then I converted it into a JavaRDDRow Then using a spark transformation I converted that into JavaRDDString[]. Then again converted it into a JavaRDDdouble[].

How to get the radius of clusters in spark K means

2015-08-20 Thread ashensw
We can get cluster centers in K means clustering. Like wise is there any method in spark to get the cluster radius? -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/How-to-get-the-radius-of-clusters-in-spark-K-means-tp24353.html Sent from the Apache Spark