Weichen Xu created SPARK-16697: ---------------------------------- Summary: redundant RDD computation in LDAOptimizer Key: SPARK-16697 URL: https://issues.apache.org/jira/browse/SPARK-16697 Project: Spark Issue Type: Improvement Components: ML, MLlib Affects Versions: 2.0.1, 2.1.0 Reporter: Weichen Xu
In mllib.clustering.LDAOptimizer the submitMiniBatch method, the stats: RDD do not persist but the following code will use it twice. so it cause redundant computation on it. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org