Xiangrui Meng created SPARK-10354: ------------------------------------- Summary: First cost RDD shouldn't be cached in k-means|| and the following cost RDD should use MEMORY_AND_DISK Key: SPARK-10354 URL: https://issues.apache.org/jira/browse/SPARK-10354 Project: Spark Issue Type: Improvement Components: MLlib Affects Versions: 1.5.0 Reporter: Xiangrui Meng Assignee: Xiangrui Meng Priority: Minor
The first RDD doesn't need to be cached, other cost RDDs should use MEMORY_AND_DISK to avoid recomputing. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org