Dear Yang Li,
> Neither the classificationTree nor the regressionTree supports
categorical feature. That means the Decision trees model can only accept
continuous feature.
Consider either manually encoding your categories in bitstrings (e.g.,
"Facebook" = 001, "Twitter" = 010, "Google" = 100), or
Hi, I`m a graduate student utilizing sklean for some data work.
And when I`m handling the data using the Decision Trees library, I found there
are some inconvenience:
Neither the classificationTree nor the regressionTree supports categorical
feature. That means the Decision trees model can only
I've read about Dask and it is a tool I want to have in my belt especially
for using the SGE connection in order to run GridSearchCV on the
supercomputer center I have access to. Should it work as promised it will
be one of my favs.
As far as my toy example I keep more limited goals with this grap
Yes, it is an efficient method, still, we need to specify the number of
clusters or the threshold. Is there another way to run hierarchy clustering
on the big dataset? The main problem is the distance matrix.
Thanks.
On Tue, Jan 2, 2018 at 6:02 AM, Olivier Grisel
wrote:
> Have you had a look at
Howdy all!
I'm pleased to announced the release of pomegranate v0.9.0. The focus of
this release is on missing value support across all model fitting /
structure learning / inference methods and models. This enables you to do
everything from fitting a multivariate Gaussian distribution to an
incom
I agree with Gael on this one and am happy to help with the PR if you need
any assistance.
Best,
Maciek
Pozdrawiam, | Best regards,
Maciek Wójcikowski
mac...@wojcikowski.pl
2017-12-29 18:14 GMT+01:00 Gael Varoquaux :
> I think that a transform method would be good. We would have to add