Re: [scikit-learn] A necessary feature for Decision trees

2018-01-03 Thread Brown J.B. via scikit-learn
Dear Yang Li, > Neither the classificationTree nor the regressionTree supports categorical feature. That means the Decision trees model can only accept continuous feature. Consider either manually encoding your categories in bitstrings (e.g., "Facebook" = 001, "Twitter" = 010, "Google" = 100), or

[scikit-learn] A necessary feature for Decision trees

2018-01-03 Thread 李扬
Hi, I`m a graduate student utilizing sklean for some data work. And when I`m handling the data using the Decision Trees library, I found there are some inconvenience: Neither the classificationTree nor the regressionTree supports categorical feature. That means the Decision trees model can only

Re: [scikit-learn] Any plans on generalizing Pipeline and transformers?

2018-01-03 Thread Manuel Castejón Limas
I've read about Dask and it is a tool I want to have in my belt especially for using the SGE connection in order to run GridSearchCV on the supercomputer center I have access to. Should it work as promised it will be one of my favs. As far as my toy example I keep more limited goals with this grap

Re: [scikit-learn] clustering on big dataset

2018-01-03 Thread Shiheng Duan
Yes, it is an efficient method, still, we need to specify the number of clusters or the threshold. Is there another way to run hierarchy clustering on the big dataset? The main problem is the distance matrix. Thanks. On Tue, Jan 2, 2018 at 6:02 AM, Olivier Grisel wrote: > Have you had a look at

[scikit-learn] pomegranate v0.9.0 released: probabilistic modeling for Python

2018-01-03 Thread Jacob Schreiber
Howdy all! I'm pleased to announced the release of pomegranate v0.9.0. The focus of this release is on missing value support across all model fitting / structure learning / inference methods and models. This enables you to do everything from fitting a multivariate Gaussian distribution to an incom

Re: [scikit-learn] MLPClassifier as a feature selector

2018-01-03 Thread Maciek Wójcikowski
I agree with Gael on this one and am happy to help with the PR if you need any assistance. Best, Maciek Pozdrawiam, | Best regards, Maciek Wójcikowski mac...@wojcikowski.pl 2017-12-29 18:14 GMT+01:00 Gael Varoquaux : > I think that a transform method would be good. We would have to add