Joseph K. Bradley created SPARK-9246: ----------------------------------------
Summary: DistributedLDAModel predict top docs per topic Key: SPARK-9246 URL: https://issues.apache.org/jira/browse/SPARK-9246 Project: Spark Issue Type: New Feature Components: MLlib Reporter: Joseph K. Bradley For each topic, return top documents based on topicDistributions. Synopsis: {code} /** * @param maxDocuments Max docs to return for each topic * @return Array over topics of (sorted top docs, corresponding doc-topic weights) */ def topDocumentsPerTopic(maxDocuments: Int): Array[(Array[Long], Array[Double])] {code} Note: We will need to make sure that the above return value format is Java-friendly. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org