[jira] [Updated] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-04 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xi Shen updated SPARK-6706: --- Description: When doing k-means cluster with the kmeans|| algorithm which is the default one. The algorithm

[jira] [Commented] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-04 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395622#comment-14395622 ] Xi Shen commented on SPARK-6706: I know it is more like a user report than a technical

[jira] [Issue Comment Deleted] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-04 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xi Shen updated SPARK-6706: --- Comment: was deleted (was: Yes, the {{collect()}} jobs finished, then hangs at the driver. Your words are

[jira] [Commented] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-04 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395641#comment-14395641 ] Xi Shen commented on SPARK-6706: Yes, the {{collect()}} jobs finished, then hangs at the

[jira] [Commented] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-04 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395642#comment-14395642 ] Xi Shen commented on SPARK-6706: Yes, the {{collect()}} jobs finished, then hangs at the

[jira] [Commented] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-03 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395457#comment-14395457 ] Xi Shen commented on SPARK-6706: The complete code and test data is available at

[jira] [Created] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-03 Thread Xi Shen (JIRA)
Xi Shen created SPARK-6706: -- Summary: kmeans|| hangs for a long time if both k and vector dimension are large Key: SPARK-6706 URL: https://issues.apache.org/jira/browse/SPARK-6706 Project: Spark

[jira] [Updated] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-03 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xi Shen updated SPARK-6706: --- Description: When doing k-means cluster with the kmeans|| algorithm which is the default one. The algorithm

[jira] [Updated] (SPARK-6706) kmeans|| hangs for a long time if both k and vector dimension are large

2015-04-03 Thread Xi Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xi Shen updated SPARK-6706: --- Attachment: kmeans-debug.7z This package contains the code and dummy data. The test data is about 100 MB. I