Re: [Scikit-learn-general] Sub sampling large datasets

2012-02-06 Thread Olivier Grisel
2012/2/6 Shishir Pandey : > Hi > > I am working with a dataset which too big to fit in the memory. Is there a > way in scikits-learn to sub sample the existing dataset maintaining its > properties so that I can load it in my RAM? We don't have any "smart" subsampler in scikit-learn (like a GMM cor

Re: [Scikit-learn-general] GridSearch

2012-02-06 Thread Andreas
On 02/03/2012 01:59 PM, Mathias Verbeke wrote: Hi Andreas, You would have to add it to the "fit" method of SVC, not GridSearchCV. How can this be done in the digits example, since there's only one fit there, namely the one of GridSearch? > Does this mean class weighting isn't possib

[Scikit-learn-general] Sub sampling large datasets

2012-02-06 Thread Shishir Pandey
Hi I am working with a dataset which too big to fit in the memory. Is there a way in scikits-learn to sub sample the existing dataset maintaining its properties so that I can load it in my RAM? with regards, --Shishir Pandey