Hi list,

Based on a couple of remarks on issues recently, it seems that we need
more text-mining examples.

I just had a crazy idea: we could use as a dataset our own documentation.
We have 35k lines in rst files, and 120k words. That's somewhat a decent
corpus.

I don't know anything about text processing. Anybody interested in
cooking up a few examples using scikit-learn to do fancy stuff on the
documentation? Things like topic modeling would be great :)

Gaƫl

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_jan
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to