Re: [scikit-learn] April 27th scikit-learn monthly meeting

2020-04-24 Thread Hermes Morales
Thank you Chiara
Which is the usual time?

Obtener Outlook para Android


From: scikit-learn  
on behalf of Chiara Marmo 
Sent: Friday, April 24, 2020 7:29:19 AM
To: Scikit-learn mailing list 
Subject: [scikit-learn] April 27th scikit-learn monthly meeting


Hi all,

The next scikit-learn monthly meeting will take place on Monday April 27th at 
the usual time: 
https://www.timeanddate.com/worldclock/meetingdetails.html?year=2020&month=4&day=27&hour=12&min=0&sec=0&p1=240&p2=33&p3=37&p4=179&p5=195

While these meetings are mainly for core-devs to discuss the current topics, 
we're also happy to welcome non-core devs and other projects maintainers! Feel 
free to join, using the following link:

https://anaconda.zoom.us/j/94399382811?pwd=cXBtQ2lTVEtVbFpVTkE3TVFxdEhqZz09

Meeting ID: 943 9938 2811
Password: 68473658


If you plan to attend and you would like to discuss something specific about 
your contribution please add your name (or github pseudo) in the "Issue and 
comments from 
contributors",
 of the public pad:

https://hackmd.io/5c6LxpnWSzeaBwJfuX5gPA


@core devs, please make sure to update your notes on Friday.


Best,

Chiara
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn


Re: [scikit-learn] Voting software

2020-04-27 Thread Hermes Morales
https://doodle.com/es/ is not bad

Obtener Outlook para Android


From: scikit-learn  
on behalf of Roman Yurchak 
Sent: Monday, April 27, 2020 10:30:49 AM
To: Scikit-learn user and developer mailing list 
Subject: Re: [scikit-learn] Voting software

BTW, could we use some online voting software for votes? Just to avoid
filling public email threads with +1s. For instance CPython uses
https://www.python.org/dev/peps/pep-8001/ but it is anonymous. Does
anyone know a simple non anonymous one preferably linked to Github
authentication?

On 27/04/2020 15:18, Nicolas Hug wrote:
> +1
>
> On 4/27/20 9:16 AM, Gael Varoquaux wrote:
>> +1
>>
>> And thank you very much Adrin!
>>
>> On Mon, Apr 27, 2020 at 09:12:02AM -0400, Andreas Mueller wrote:
>>> Hi All.
>>> Given all his recent contributions, I want to nominate Adrin Jalali
>>> to the
>>> Technical Committee:
>>> https://scikit-learn.org/stable/governance.html#technical-committee
>>> According to the governance document, this will require a discussion
>>> and
>>> vote.
>>> I think we can move to the vote immediately unless someone objects.
>>> Thanks for all your work Adrin!
>>> Cheers,
>>> Andy
>>> ___
>>> scikit-learn mailing list
>>> scikit-learn@python.org
>>> https://mail.python.org/mailman/listinfo/scikit-learn
> ___
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn


Re: [scikit-learn] Why does sklearn require one-hot-encoding for categorical features? Can we have a "factor" data type?

2020-04-30 Thread Hermes Morales
Perhaps pd.factorize could hello?

Obtener Outlook para Android


From: scikit-learn  
on behalf of Gael Varoquaux 
Sent: Thursday, April 30, 2020 5:12:06 PM
To: Scikit-learn mailing list 
Subject: Re: [scikit-learn] Why does sklearn require one-hot-encoding for 
categorical features? Can we have a "factor" data type?

On Thu, Apr 30, 2020 at 03:55:00PM -0400, C W wrote:
> I've used R and Stata software, none needs such transformation. They have a
> data type called "factors", which is different from "numeric".

> My problem with OHE:
> One-hot-encoding results in large number of features. This really blows up
> quickly. And I have to fight curse of dimensionality with PCA reduction. 
> That's
> not cool!

Most statistical models still not one-hot encoding behind the hood. So, R
and stata do it too.

Typically, tree-based models can be adapted to work directly on
categorical data. Ours don't. It's work in progress.

G
___
scikit-learn mailing list
scikit-learn@python.org
https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fmail.python.org%2Fmailman%2Flistinfo%2Fscikit-learn&data=02%7C01%7C%7Ce7aa6f99b7914a1f84b208d7ed430801%7C84df9e7fe9f640afb435%7C1%7C0%7C637238744453345410&sdata=e3BfHB4v5VFteeZ0Zh3FJ9Wcz9KmkUwur5i8Reue3mc%3D&reserved=0
___
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn