[ https://issues.apache.org/jira/browse/SPARK-16786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joseph K. Bradley closed SPARK-16786. ------------------------------------- Resolution: Won't Fix > LDA topic distributions for new documents in PySpark > ---------------------------------------------------- > > Key: SPARK-16786 > URL: https://issues.apache.org/jira/browse/SPARK-16786 > Project: Spark > Issue Type: Improvement > Components: MLlib, PySpark > Affects Versions: 2.0.0 > Environment: N/A > Reporter: Jordan Beauchamp > Priority: Minor > Labels: patch > Original Estimate: 24h > Remaining Estimate: 24h > > pyspark.mllib.clustering.LDAModel has no way to estimate the topic > distribution for new documents. However, this functionality exists in > org.apache.spark.mllib.clustering.LDAModel. This change would only require > setting up the API calls. I have forked the spark repo and implemented the > changes locally -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org