[scikit-learn] Possible bug in BayesianGaussianMixture?

2019-02-06 Thread Stefan Ulbrich via scikit-learn
Hello, I think I might have found a bug in the BayesianGaussianMixture–or at least encountered a behavior that I was not expecting. The problem occurs when having clusters with small extent (in my case, it is 2D geographic data) that are far away from each other. While the means and their number a

Re: [scikit-learn] AUCROC/MAP confidence intervals in scikit

2019-02-06 Thread Sebastian Raschka
Hi Stuart, I don't think so because there is no standard way to compute CI's. That goes for all performance measures (accuracy, precision, recall, etc.). Some people use simple binomial approximation intervals, some people prefer bootstrapping etc. And it also depends on the data you have. In l

[scikit-learn] AUCROC/MAP confidence intervals in scikit

2019-02-06 Thread Stuart Reynolds
https://papers.nips.cc/paper/2645-confidence-intervals-for-the-area-under-the-roc-curve.pdf Does scikit (or other Python libraries) provide functions to measure the confidence interval of AUROC scores? Same question also for mean average precision. It seems like this should be a standard results r