Re: [Scikit-learn-general] Scalability of Gradient Boosting Classifier

2015-10-01 Thread Gael Varoquaux
On Thu, Oct 01, 2015 at 11:10:51AM +0200, Maryam Tavakol wrote: > My problem however is the size of data in terms of number of samples. > The features are engineered and are only 80. I wanted to try training > on bigger set of data for improvement. I would use the BIRCH clustering method in an onl

Re: [Scikit-learn-general] [New feature] sklearn to PMML

2015-10-01 Thread Joel Nothman
Hi Mira, I think the community is very interested in this work, but you might consider collaborating with https://github.com/alex-pirozhenko/sklearn-pmml. Its support for models is limited to trees and their ensembles, but it also includes a test harness ( https://github.com/alex-pirozhenko/sklear

[Scikit-learn-general] [New feature] sklearn to PMML

2015-10-01 Thread Mira Epheldel
Hello, I've started working on a project that exports sklearn models to PMML format. Since I'm new to open source etc, I'm not sure if I should post to the mailing list or not about the kind of question I have, but anyway here I am. First of all, I'm not sure if this project it is interresting en

Re: [Scikit-learn-general] Scalability of Gradient Boosting Classifier

2015-10-01 Thread Maryam Tavakol
Hi Jacob, Thank you for clarification. My problem however is the size of data in terms of number of samples. The features are engineered and are only 80. I wanted to try training on bigger set of data for improvement. Thanks & Best, Maryam > > -- > > Message: 3 > Da