Hi list, Based on a couple of remarks on issues recently, it seems that we need more text-mining examples.
I just had a crazy idea: we could use as a dataset our own documentation. We have 35k lines in rst files, and 120k words. That's somewhat a decent corpus. I don't know anything about text processing. Anybody interested in cooking up a few examples using scikit-learn to do fancy stuff on the documentation? Things like topic modeling would be great :) Gaƫl ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_jan _______________________________________________ Scikit-learn-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
