[jira] [Commented] (SPARK-21244) KMeans applied to processed text day clumps almost all documents into one cluster

2017-07-01 Thread Nassir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16071447#comment-16071447 ] Nassir commented on SPARK-21244: Hi, The pyspark k-means implementation is on the same 20 newsgroup

[jira] [Commented] (SPARK-21244) KMeans applied to processed text day clumps almost all documents into one cluster

2017-06-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16066833#comment-16066833 ] Sean Owen commented on SPARK-21244: --- There's no detail here that suggests a Spark bug. Depending on