DistributedLDAModel missing APIs in org.apache.spark.ml

2016-03-19 Thread cindymc
I like using the new DataFrame APIs on Spark ML, compared to using RDDs in the older SparkMLlib. But it seems some of the older APIs are missing. In particular, '*.mllib.clustering.DistributedLDAModel' had two APIs that I need now: topDocumentsPerTopic topTopicsPerDocument How can I get at the

Re: DistributedLDAModel missing APIs in org.apache.spark.ml

2016-03-18 Thread Ted Yu
Can you utilize this function of DistributedLDAModel ? override protected def getModel: OldLDAModel = oldDistributedModel cheers On Fri, Mar 18, 2016 at 7:34 AM, cindymc wrote: > I like using the new DataFrame APIs on Spark ML, compared to using RDDs in > the