I'm not exactly sure how you would like to setup your LDA model, but I noticed there was no Python example for LDA in Spark. I created this issue to add it https://issues.apache.org/jira/browse/SPARK-13500. Keep an eye on this if it could be of help.
bryan On Wed, Feb 24, 2016 at 8:34 PM, Mishra, Abhishek <abhishek.mis...@xerox.com > wrote: > Hello All, > > > > If someone has any leads on this please help me. > > > > Sincerely, > > Abhishek > > > > *From:* Mishra, Abhishek > *Sent:* Wednesday, February 24, 2016 5:11 PM > *To:* user@spark.apache.org > *Subject:* LDA topic Modeling spark + python > > > > Hello All, > > > > > > I am doing a LDA model, please guide me with something. > > > > I have a csv file which has two column "user_id" and "status". I have to > generate a word-topic distribution after aggregating the user_id. Meaning > to say I need to model it for users on their grouped status. The topic > length being 2000 and value of k or number of words being 3. > > > > Please, if you can provide me with some link or some code base on spark > with python ; I would be grateful. > > > > > > Looking forward for a reply, > > > > Sincerely, > > Abhishek > > >