On 07/25/2014 10:30 PM, Pagliari, Roberto wrote:

Hi Andy,

Maybe it's just me, but the "left out data" threw me off. Perhaps, I would integrate with your previous comments:

best_estimator_

        

estimator

        

Estimator that was chosen by the search, i.e. estimator which gave highest *average* score (or smallest loss if specified) *over the cross-validation folds*. on the left out data.

best_score_

        

float

        

*Highest average score*of *the* best_estimator *computed above* on the left out data.

This is not entirely correct. The "best_estimator_" is retrained on the whole training set, while best_score_ is the average over folds. I like your string for best_estimator_, but for best_score_ I would probably also say "Highest average score of the best parameter setting over cross-validation folds".

Pull request welcome. The current docstring warrants improvement I think ;)
------------------------------------------------------------------------------
Want fast and easy access to all the code in your enterprise? Index and
search up to 200,000 lines of code with a free copy of Black Duck
Code Sight - the same software that powers the world's largest code
search on Ohloh, the Black Duck Open Hub! Try it now.
http://p.sf.net/sfu/bds
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to