[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xi Shen updated SPARK-6706:
---
Description:
When doing k-means cluster with the kmeans|| algorithm which is the default
one. The algorithm
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395622#comment-14395622
]
Xi Shen commented on SPARK-6706:
I know it is more like a user report than a technical
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xi Shen updated SPARK-6706:
---
Comment: was deleted
(was: Yes, the {{collect()}} jobs finished, then hangs at the driver. Your
words are
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395641#comment-14395641
]
Xi Shen commented on SPARK-6706:
Yes, the {{collect()}} jobs finished, then hangs at the
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395642#comment-14395642
]
Xi Shen commented on SPARK-6706:
Yes, the {{collect()}} jobs finished, then hangs at the
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14395457#comment-14395457
]
Xi Shen commented on SPARK-6706:
The complete code and test data is available at
Xi Shen created SPARK-6706:
--
Summary: kmeans|| hangs for a long time if both k and vector
dimension are large
Key: SPARK-6706
URL: https://issues.apache.org/jira/browse/SPARK-6706
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xi Shen updated SPARK-6706:
---
Description:
When doing k-means cluster with the kmeans|| algorithm which is the default
one. The algorithm
[
https://issues.apache.org/jira/browse/SPARK-6706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xi Shen updated SPARK-6706:
---
Attachment: kmeans-debug.7z
This package contains the code and dummy data. The test data is about 100 MB. I