After reading the internal code of Spark about it, I wasn't able to understand why it calls takeSample() twice? Can someone please explain?
There is a relevant StackOverflow question <http://stackoverflow.com/questions/38986395/sparkkmeans-calls-takesample-twice> . -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/KMeans-calls-takeSample-twice-tp18761.html Sent from the Apache Spark Developers List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org