I just implemented Latent Dirichlet Allocation with collapsed Gibbs sampling and made a demo on 20 Newsgroups. If there's interest in having this in sklearn, I could clean up the code for contribution.
I noticed there was same discussion back in January about PyMC that didn't reach an actionable conclusion (unless the list archives broke the thread up). It's not terribly apropos for this code (which is based on analytically integrating out much of what PyMC would have to keep track of), but feel free to use this as an excuse to finish that discussion :) -Ken ------------------------------------------------------------------------------ The demand for IT networking professionals continues to grow, and the demand for specialized networking skills is growing even more rapidly. Take a complimentary Learning@Cisco Self-Assessment and learn about Cisco certifications, training, and career opportunities. http://p.sf.net/sfu/cisco-dev2dev _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
