Re: [scikit-learn] Analysis of sklearn and other python libraries on github by MS team

2020-03-30 Thread Andreas Mueller
Also see https://github.com/scikit-learn/scikit-learn/issues/14268 which is discussing how to make things faster *and* more stable! On 3/30/20 10:30 AM, Andreas Mueller wrote: On 3/27/20 6:20 PM, Gael Varoquaux wrote: Thanks for the link Andy. This is indeed very interesting! On Fri, Mar 27

Re: [scikit-learn] Analysis of sklearn and other python libraries on github by MS team

2020-03-30 Thread Andreas Mueller
On 3/27/20 6:20 PM, Gael Varoquaux wrote: Thanks for the link Andy. This is indeed very interesting! On Fri, Mar 27, 2020 at 06:10:28PM +0100, Roman Yurchak wrote: Regarding learners, Top-5 in both GH17 and GH19 are LogisticRegression, MultinomialNB, SVC, LinearRegression, and RandomForestCl

Re: [scikit-learn] Analysis of sklearn and other python libraries on github by MS team

2020-03-28 Thread Gael Varoquaux
Thanks for the link Andy. This is indeed very interesting! On Fri, Mar 27, 2020 at 06:10:28PM +0100, Roman Yurchak wrote: > > Regarding learners, Top-5 in both GH17 and GH19 are LogisticRegression, > > MultinomialNB, SVC, LinearRegression, and RandomForestClassifier (in this > > order). > Maybe L

Re: [scikit-learn] Analysis of sklearn and other python libraries on github by MS team

2020-03-27 Thread Roman Yurchak
Very interesting! A few comments, > From GH17, we managed to extract only 10.5k pipelines. The relatively low frequency (with respect to the number of notebooks using SCIKIT-LEARN [..]) indicates a non-wide adoption of this specification. However, the number of pipelines in the GH19 corpus is

[scikit-learn] Analysis of sklearn and other python libraries on github by MS team

2020-03-27 Thread Andreas Mueller
Hey all. There's a pretty cool paper by a team at MS that analyses public github repos for their use of the sklearn and related libraries: https://arxiv.org/abs/1912.09536 Thought it might be of interest. Cheers, Andy ___ scikit-learn mailing list sc