It only makes sense train a tree on a subset as part of an ensembl method,
and in that case you can train a set of trees by training each one on a
subset of the data (be sure to randomly choose the subset though).
It's true that ensembl methods like RandomForest don't have partial_fit,
but you could still train the model on subsets and combine them.
Tommy
On Sat, Nov 17, 2012 at 11:19 AM, Ronnie Ghose <[email protected]>wrote:
> See you guys just said I could use trees on subsets and they will work
> well.
>
> So why not partial_fits + trees?
>
>
> On 17 November 2012 11:12, Gael Varoquaux
> <[email protected]>wrote:
>
>> On Sat, Nov 17, 2012 at 11:10:52AM -0500, Ronnie Ghose wrote:
>> > hmm i'm asking is it possible to run all of the typical ~ whatever that
>> > means ~ models in sklearn on a subset of that data and have it work
>> > pretty well most of the time?
>>
>> No, only those that have 'partial_fit'.
>>
>> G
>>
>>
>>
>> ------------------------------------------------------------------------------
>> Monitor your physical, virtual and cloud infrastructure from a single
>> web console. Get in-depth insight into apps, servers, databases, vmware,
>> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
>> Pricing starts from $795 for 25 servers or applications!
>> http://p.sf.net/sfu/zoho_dev2dev_nov
>> _______________________________________________
>> Scikit-learn-general mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>>
>
>
>
> ------------------------------------------------------------------------------
> Monitor your physical, virtual and cloud infrastructure from a single
> web console. Get in-depth insight into apps, servers, databases, vmware,
> SAP, cloud infrastructure, etc. Download 30-day Free Trial.
> Pricing starts from $795 for 25 servers or applications!
> http://p.sf.net/sfu/zoho_dev2dev_nov
> _______________________________________________
> Scikit-learn-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>
>
------------------------------------------------------------------------------
Monitor your physical, virtual and cloud infrastructure from a single
web console. Get in-depth insight into apps, servers, databases, vmware,
SAP, cloud infrastructure, etc. Download 30-day Free Trial.
Pricing starts from $795 for 25 servers or applications!
http://p.sf.net/sfu/zoho_dev2dev_nov
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general