Re: Difference in R and Spark Output

Felix Cheung Fri, 30 Dec 2016 21:07:31 -0800

Could you elaborate more on the huge difference you are seeing?

________________________________
From: Saroj C <saro...@tcs.com>
Sent: Friday, December 30, 2016 5:12:04 AM
To: User
Subject: Difference in R and Spark Output

Dear All,
 For the attached input file, there is a huge difference between the Clusters 
in R and Spark(ML). Any idea, what could be the difference ?

Note we wanted to create Five(5) clusters.

Please find the snippets in Spark and R

Spark

//Load the Data file

// Create K means Cluster
        KMeans kmeans = new KMeans().setK(5).setMaxIter(500)

.setFeaturesCol("features").setPredictionCol("prediction");

In R

//Load the Data File into df

//Create the K Means Cluster

model <- kmeans(df, 5)

Thanks & Regards
Saroj

=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you

Re: Difference in R and Spark Output

Reply via email to