[
https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-10329:
--
Assignee: hujiayin
Cost RDD in k-means|| initialization is not storage-efficient
[
https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-10329:
--
Description:
Currently we use `RDD[Vector]` to store point cost during k-means||
[
https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-10329:
--
Labels: clustering (was: )
Cost RDD in k-means initialization is not storage-efficient
[
https://issues.apache.org/jira/browse/SPARK-10329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng updated SPARK-10329:
--
Summary: Cost RDD in k-means|| initialization is not storage-efficient
(was: Cost RDD in