[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537482#comment-14537482 ] yuhao yang commented on SPARK-7496: --- Thanks Joseph. PR sent. > Update Programming guide with Online LDA > > > Key: SPARK-7496 > URL: https://issues.apache.org/jira/browse/SPARK-7496 > Project: Spark > Issue Type: Documentation > Components: Documentation, MLlib >Reporter: Joseph K. Bradley >Priority: Minor > > Update LDA subsection of clustering section of MLlib programming guide to > include OnlineLDA -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537480#comment-14537480 ] Apache Spark commented on SPARK-7496: - User 'hhbyyh' has created a pull request for this issue: https://github.com/apache/spark/pull/6046 > Update Programming guide with Online LDA > > > Key: SPARK-7496 > URL: https://issues.apache.org/jira/browse/SPARK-7496 > Project: Spark > Issue Type: Documentation > Components: Documentation, MLlib >Reporter: Joseph K. Bradley >Priority: Minor > > Update LDA subsection of clustering section of MLlib programming guide to > include OnlineLDA -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537316#comment-14537316 ] Joseph K. Bradley commented on SPARK-7496: -- "LDA takes in a collection of documents as vectors of word counts." sounds like it duplicates what's already there. The 2nd part sounds good. Would you mind taking this JIRA and sending a PR? Thanks! > Update Programming guide with Online LDA > > > Key: SPARK-7496 > URL: https://issues.apache.org/jira/browse/SPARK-7496 > Project: Spark > Issue Type: Documentation > Components: Documentation, MLlib >Reporter: Joseph K. Bradley >Priority: Minor > > Update LDA subsection of clustering section of MLlib programming guide to > include OnlineLDA -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537114#comment-14537114 ] yuhao yang commented on SPARK-7496: --- Hi Joseph, just something I got for your reference, LDA takes in a collection of documents as vectors of word counts. It supports different inference algorithms via setOptimizer function. EMLDAOptimizer learns clustering using expectation-maximization on the likelihood function, while OnlineLDAOptimizer uses iterative mini-batch sampling for online variational inference, After fitting on the documents, LDA provides: > Update Programming guide with Online LDA > > > Key: SPARK-7496 > URL: https://issues.apache.org/jira/browse/SPARK-7496 > Project: Spark > Issue Type: Documentation > Components: Documentation, MLlib >Reporter: Joseph K. Bradley >Priority: Minor > > Update LDA subsection of clustering section of MLlib programming guide to > include OnlineLDA -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org