Re: [Scikit-learn-general] Generalised warm start / parameter search

Andreas Mueller Mon, 20 May 2013 03:23:59 -0700

On 05/20/2013 05:20 AM, Joel Nothman wrote:

I couldn't help but work on it, it seems.
The Pipeline's refit is trivial given that all sub-estimators have arefit that will do nothing if certain parameters are not passed (andin case not all have set their noop_params, we can explicitly onlyrefit from the first step where a parameter is changed).
By trivial, I mean just take the current fit(_transform)implementation and replace fit(_transform) with refit(_transform),passing each step its parameters.
Its _plan_refits is not as trivial. First, let us assume no steps areset as parameters<https://github.com/scikit-learn/scikit-learn/pull/1769>. Consider theparameters as a tree, like that attached, with each layercorresponding to a step and the parameters set there. (The treeattached corresponds to a grid, but it needn't be so balanced.) If youreorder the children of each node using the relevant _plan_refits(perhaps with memoization for the common grid case), and aggregate thecosts appropriately, you should result in a good plan.

I will have to think about this after the NIPS deadline ;)

Now, I don't know enough about regularisation paths. I am worried theydon't fit naturally in this framework, because they need all candidatevalues for the the relevant parameter to be specified at once; I washoping someone would shout that at me when I proposed this. Could youplease clarify?

Basically you have a highest and lowest value of the regularizationvalue, fit one model for the highest value and can efficiently producemodels for all possible values

in between.

The thing here is that you can efficiently compute the models for allvalues of the parameter if you compute them together.

This is basically the opposite of your "group by value" strategy.

If you are not so much into linear models, another way of thinking aboutit is with trees / forests: if you compute a tree up to a certain depth,you basicallyget the tree with smaller depths for free - similar things are true forboosting.

For these you basically need to know a maximum and / or minimumparameter setting and compute solutions for all parameter setting at once.I feel like this is the really interesting case, which we should try tosolve.

Basically your proposal addresses cases where one doesn't need to touchparts of the pipeline at all.

It wouldn't help us get rid of any of the CV objects, though.

Is there something interesting about StandardScaler, or have youthrown it in for fun? or for an example where transform is moreexpensive than fit?

Just for fun ;) Basically I thought that was one that you don't reallyneed to refit at all (for a given fold) as you usually don't search overany parameters.

------------------------------------------------------------------------------
AlienVault Unified Security Management (USM) platform delivers complete
security visibility with the essential security capabilities. Easily and
efficiently configure, manage, and operate all of your security controls
from a single console and one unified framework. Download a free trial.
http://p.sf.net/sfu/alienvault_d2d

_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] Generalised warm start / parameter search

Reply via email to