Could you elaborate more on the huge difference you are seeing?
________________________________ From: Saroj C <saro...@tcs.com> Sent: Friday, December 30, 2016 5:12:04 AM To: User Subject: Difference in R and Spark Output Dear All, For the attached input file, there is a huge difference between the Clusters in R and Spark(ML). Any idea, what could be the difference ? Note we wanted to create Five(5) clusters. Please find the snippets in Spark and R Spark //Load the Data file // Create K means Cluster KMeans kmeans = new KMeans().setK(5).setMaxIter(500) .setFeaturesCol("features").setPredictionCol("prediction"); In R //Load the Data File into df //Create the K Means Cluster model <- kmeans(df, 5) Thanks & Regards Saroj =====-----=====-----===== Notice: The information contained in this e-mail message and/or attachments to it may contain confidential or privileged information. If you are not the intended recipient, any dissemination, use, review, distribution, printing or copying of the information contained in this e-mail message and/or attachments to it are strictly prohibited. If you have received this communication in error, please notify us by reply e-mail or telephone and immediately and permanently delete the message and any attachments. Thank you