[ https://issues.apache.org/jira/browse/SPARK-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15176747#comment-15176747 ]
Joseph K. Bradley commented on SPARK-13161: ------------------------------------------- There are many generalizations of LDA, so it would be valuable to know about people's use cases and needs. Do you have a use case you could describe for this? It would be great to have this feature as a Spark package in the meantime. > Extend MLlib LDA to include options for Author Topic Modeling > ------------------------------------------------------------- > > Key: SPARK-13161 > URL: https://issues.apache.org/jira/browse/SPARK-13161 > Project: Spark > Issue Type: Improvement > Components: MLlib > Affects Versions: 1.6.0 > Reporter: John Hogue > > The author-topic model, a generative model for documents that extends Latent > Dirichlet Allocation. > By modeling the interests of authors, we can answer a range of important > queries about the content of document collections. With an appropriate author > model, we can establish which subjects an author writes about, which authors > are likely to have written documents similar to an observed document, and > which authors produce similar work. > Full whitepaper here. > http://mimno.infosci.cornell.edu/info6150/readings/398.pdf -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org