Re: [scikit-learn] Sprint discussion points?

Andreas Mueller Thu, 14 Feb 2019 05:29:00 -0800


On 2/13/19 11:28 PM, Joel Nothman wrote:

Convergence in logistic regression(https://github.com/scikit-learn/scikit-learn/issues/11536) is indeedone problem (and it presents a general issue of what max_iter meanswhen you have several solvers, or how good defaults are selected). ButI was sure we had problems with non-determinism on some platforms...but now can't find.
> my students have basically no way to figure out what features thecoefficients in their linear model correspond to, that seems a bitmore important to me.
Yes, I agree... Assuming coefficients are helpful, rather than usingpermutation-based measures of importance, for instance.

You would apply the permutation based feature importances before anypreprocessing? I guess there's a case to be made for either option.

I think there are good reasons to look at coefficients though.

I generally think a review of distances might be a good thing at somepoint, given the confusing triplication across sklearn.neighbors,sklearn.metrics.pairwise, scipy.spatial... and that minkowski,p=2 isnot implemented the same as euclidean.

Yes, I agree. I guess right now I'm more enthusiastic about newfeatures/APIs than decreasing technical debt, maybe because you're theone dealing with the technical debt ;)

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Re: [scikit-learn] Sprint discussion points?

Reply via email to