Re: [Scikit-learn-general] Planning on implementing sample_weight option for PLSRegression.fit()

2015-07-06 Thread Gael Varoquaux
On Mon, Jul 06, 2015 at 04:55:07PM -0400, Deepak Subburam wrote: > I have some time to contribute, and would like to implement a > sample_weight option to the fit() method of > sklearn.cross_decomposition.PLSRegression, handling any other knock-on > effects. Let me know if you think this is not

Re: [Scikit-learn-general] Number of subsamples in Random Forest

2015-07-06 Thread Sebastian Raschka
Thanks, Jeff, that was what I expected, but I just wanted to make sure. I will add a note via a pull request later. > On Jul 6, 2015, at 3:52 PM, Jeff Blackburne wrote: > > Hi Sebastian, > > The bootstrap sample size is always the same as the input sample size. If you > feel up to it, a pull

[Scikit-learn-general] Planning on implementing sample_weight option for PLSRegression.fit()

2015-07-06 Thread Deepak Subburam
Hi, I have some time to contribute, and would like to implement a sample_weight option to the fit() method of sklearn.cross_decomposition.PLSRegression, handling any other knock-on effects. Let me know if you think this is not a good idea, or have any other thoughts to guide me. Thanks, Deepa

Re: [Scikit-learn-general] Another RandomForest question: Max_features "auto" vs "sqrt"

2015-07-06 Thread Jeff Blackburne
That is the intended behavior. If you look at RandomForestRegressor, "auto" means something different. -Jeff On Mon, Jul 6, 2015 at 12:35 PM, Sebastian Raschka wrote: > Hi, > another RandomForest-related question, in the documentation it reads like > the “auto” and “sqrt” are the same for sele

Re: [Scikit-learn-general] Number of subsamples in Random Forest

2015-07-06 Thread Jeff Blackburne
Hi Sebastian, The bootstrap sample size is always the same as the input sample size. If you feel up to it, a pull request updating the documentation would probably be quite welcome. -Jeff On Mon, Jul 6, 2015 at 12:21 PM, Sebastian Raschka wrote: > Hi, all, > > I am wondering what the sample s

[Scikit-learn-general] Another RandomForest question: Max_features "auto" vs "sqrt"

2015-07-06 Thread Sebastian Raschka
Hi, another RandomForest-related question, in the documentation it reads like the “auto” and “sqrt” are the same for selecting max_features, is this the intended behavior? http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html • If “auto”, then ma

[Scikit-learn-general] Number of subsamples in Random Forest

2015-07-06 Thread Sebastian Raschka
Hi, all, I am wondering what the sample size of the bootstrap samples are in the RandomForestClassifier/Regressor (if bootstrap=True, default setting). From the documentation http://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html#sklearn.ensemble.RandomFo

Re: [Scikit-learn-general] Scikit-learn with gcc 4.2 on FreeBSD.

2015-07-06 Thread Andy
Hi Nastran. This looks like a scipy problem. I guess it depends on which blas you link against. I'm not sure if it runs on BSD, but you could try running anaconda http://continuum.io/downloads Otherwise maybe check on the scipy list. Andy On 07/02/2015 10:04 PM, Nastaran Baradaran wrote: Hi,