On 3 March 2012 00:31, Andreas <[email protected]> wrote:

> On 03/02/2012 02:33 PM, Olivier Grisel wrote:
> > 2012/3/2 Andreas<[email protected]>:
> >
> >> Hi everybody!
> >> As I am in charge of the next release, here a quick pep talk and
> >> wish list:
> >>
> >> We are two month into .10 now and I feel the spirit of Granada
> >> is still with us with lots of activity going on all the time.
> >> Loads of great work has happened since the last
> >> release, even though much was "behind the scenes" and fixes.
> >>
> >> There are some things that I would really love to have in
> >> 0.11 and that I think there is still enough time to do.
> >>
> >> I think one of the great things about sklearn is the documentation,
> >> but as was pointed out recently, there is still room for improvement.
> >>
> >> In particular, it really bugs me that there is no documentation for
> >> LDA and the text features.
> >> I will try to do LDA but I really can not do documentation for text
> >> features. So it would be *really* great if someone with some
> >> NLP knowledge could sit down and write some documentation
> >> for this part of the scikit.
> >>
> > I am currently tinkering with a full flattening of the
> > sklearn.feature_extraxtion.text module (no more preprocessor and
> > analyzers, just vectorizers + TfIdf transformer). Will finish and
> > submit an PR this WE along with the narrative doc.
> >
> > This will be much simpler that the current state and will make it
> > easier to implement the murmurhash3 hashing vectorizer afterwards.
> >
> >
> Awesome :) I don't have much insight into the module but
> it felt like it needed some love. Thumbs up!
>
>
> ------------------------------------------------------------------------------
> Virtualization & Cloud Management Using Capacity Planning
> Cloud computing makes use of virtualization - but cloud computing
> also focuses on allowing computing to be delivered as a service.
> http://www.accelacomm.com/jaw/sfnl/114/51521223/
> _______________________________________________
> Scikit-learn-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>


I'm looking forward to the text.py changes!
-- 

Public key at: http://pgp.mit.edu/ Search for this email address and select
the key from "2011-08-19" (key id: 54BA8735)
------------------------------------------------------------------------------
Virtualization & Cloud Management Using Capacity Planning
Cloud computing makes use of virtualization - but cloud computing 
also focuses on allowing computing to be delivered as a service.
http://www.accelacomm.com/jaw/sfnl/114/51521223/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to