Re: [Scikit-learn-general] KFold cross validation strangely defaults to not shuffle

2014-04-21 Thread Skipper Seabold
On Thu, Apr 17, 2014 at 9:56 AM, Mathieu Blondel wrote: > It seems to me that you assume the order in which a dataset is laid out is > meaningful. I think there are cases when this order might be completely > artificial and not reflect the true distribution of the data. For me, the > order is an i

Re: [Scikit-learn-general] KFold cross validation strangely defaults to not shuffle

2014-04-21 Thread Cory Dolphin
Mathieu, in my case the order of the data is also an implementation detail. The default of not-shuffling produced an unfair split. Due to the fact that my data was sorted as a way or organizing the querying and packing of the data, models were trained and tested on data that were almost as disparat

[Scikit-learn-general] scikit-learn roc_curve returns strange threshold

2014-04-21 Thread Kuo Liu
Hello, I've posted my question on Stack Overflow: http://stackoverflow.com/questions/23200518/scikit-learn-roc-curve-why-does-it-return-a-threshold-value-2-some-time Thanks for the help! -- Start Your Social Network Today