[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA

2015-05-10 Thread yuhao yang (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537482#comment-14537482
 ] 

yuhao yang commented on SPARK-7496:
---

Thanks Joseph. PR sent.

> Update Programming guide with Online LDA
> 
>
> Key: SPARK-7496
> URL: https://issues.apache.org/jira/browse/SPARK-7496
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation, MLlib
>Reporter: Joseph K. Bradley
>Priority: Minor
>
> Update LDA subsection of clustering section of MLlib programming guide to 
> include OnlineLDA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA

2015-05-10 Thread Apache Spark (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537480#comment-14537480
 ] 

Apache Spark commented on SPARK-7496:
-

User 'hhbyyh' has created a pull request for this issue:
https://github.com/apache/spark/pull/6046

> Update Programming guide with Online LDA
> 
>
> Key: SPARK-7496
> URL: https://issues.apache.org/jira/browse/SPARK-7496
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation, MLlib
>Reporter: Joseph K. Bradley
>Priority: Minor
>
> Update LDA subsection of clustering section of MLlib programming guide to 
> include OnlineLDA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA

2015-05-10 Thread Joseph K. Bradley (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537316#comment-14537316
 ] 

Joseph K. Bradley commented on SPARK-7496:
--

"LDA takes in a collection of documents as vectors of word counts." sounds like 
it duplicates what's already there.  The 2nd part sounds good.  Would you mind 
taking this JIRA and sending a PR? Thanks!

> Update Programming guide with Online LDA
> 
>
> Key: SPARK-7496
> URL: https://issues.apache.org/jira/browse/SPARK-7496
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation, MLlib
>Reporter: Joseph K. Bradley
>Priority: Minor
>
> Update LDA subsection of clustering section of MLlib programming guide to 
> include OnlineLDA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-7496) Update Programming guide with Online LDA

2015-05-10 Thread yuhao yang (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537114#comment-14537114
 ] 

yuhao yang commented on SPARK-7496:
---

Hi Joseph, just something I got for your reference,

LDA takes in a collection of documents as vectors of word counts. It supports 
different inference algorithms via setOptimizer function. EMLDAOptimizer learns 
clustering using expectation-maximization on the likelihood function, while 
OnlineLDAOptimizer uses iterative mini-batch sampling for online variational 
inference,

After fitting on the documents, LDA provides:

> Update Programming guide with Online LDA
> 
>
> Key: SPARK-7496
> URL: https://issues.apache.org/jira/browse/SPARK-7496
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation, MLlib
>Reporter: Joseph K. Bradley
>Priority: Minor
>
> Update LDA subsection of clustering section of MLlib programming guide to 
> include OnlineLDA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org