Very interesting! A few comments,
> From GH17, we managed to extract only 10.5k pipelines. The
relatively low frequency (with respect to the number of notebooks using
SCIKIT-LEARN [..]) indicates a non-wide adoption of this specification.
However, the number of pipelines in the GH19 corpus is
There's an interesting analysis in this paper:
Fast K-Means with Accurate Bounds
http://proceedings.mlr.press/v48/newling16.pdf
On 3/26/20 3:40 AM, Alexandre Gramfort wrote:
hi,
I suspect Elkan is really winning when you have many centroids
so the conclusion is not systematic
my 2c
Alex
On
Hey all.
There's a pretty cool paper by a team at MS that analyses public github
repos for their use of the sklearn and related libraries:
https://arxiv.org/abs/1912.09536
Thought it might be of interest.
Cheers,
Andy
___
scikit-learn mailing list
sc