Re: [Scikit-learn-general] Simulated annealing for parameter search?

2014-06-23 Thread Joel Nothman
On 23 June 2014 23:31, Sturla Molden wrote: > Joel Nothman wrote: > > A number of generic parameter search functions are available in > > scipy.optimize, including simulated annealing. > > SA is deprecated in scipy.optimize. > Oh!

Re: [Scikit-learn-general] Simulated annealing for parameter search?

2014-06-23 Thread Sturla Molden
Joel Nothman wrote: > A number of generic parameter search functions are available in > scipy.optimize, including simulated annealing. SA is deprecated in scipy.optimize. -- Open source business process management suit

Re: [Scikit-learn-general] Simulated annealing for parameter search?

2014-06-23 Thread Mathieu Blondel
> A number of generic parameter search functions are available in > scipy.optimize, including simulated annealing. To wrap them in a > scikit-learn interface is fairly trivial. If you are talking about model > selection using simulated annealing, I once wrote a GridSearchCV-like > extension that co

Re: [Scikit-learn-general] Simulated annealing for parameter search?

2014-06-23 Thread Olivier Grisel
It's implemented in hyperopt master: https://github.com/hyperopt/hyperopt/blob/master/hyperopt/anneal.py and it has sklearn integration here: https://github.com/hyperopt/hyperopt-sklearn -- Olivier -- Open source busi

Re: [Scikit-learn-general] Simulated annealing for parameter search?

2014-06-23 Thread Joel Nothman
A number of generic parameter search functions are available in scipy.optimize, including simulated annealing. To wrap them in a scikit-learn interface is fairly trivial. If you are talking about model selection using simulated annealing, I once wrote a GridSearchCV-like extension that could use an

[Scikit-learn-general] Simulated annealing for parameter search?

2014-06-23 Thread Michal Romaniuk
Has anyone worked on simulated annealing or similar algorithms for parameter search? Michal -- Open source business process management suite built on Java and Eclipse Turn processes into business applications with Bonita

Re: [Scikit-learn-general] Scikit learn's multiprocessing

2014-06-23 Thread Greg Dhuse
Sturla Molden writes: > > Rich Lewis wrote: > > > I have been running RandomForestRegressor models for a while now on my > > MacBook using the n_jobs=-1 option, which has worked well in the past. > > The cue here might be "MacBook"... > > Which LAPACK is NumPy and SciPy linked against? > >

Re: [Scikit-learn-general] Implementation of new Anomaly/Outlier Detection Algorithms

2014-06-23 Thread Nicolas Goix
Hello, The following study evaluates on the DARPA 1998 data set four outlier detection algorithms : Unserpervised SVM, LOF approach, NN approach and Mahalanobis-based approach : http://static.msi.umn.edu/rreports/2003/72.pdf They find the LOF approach to be the more efficient, followed by the

Re: [Scikit-learn-general] Scikit-learn-general Digest, Vol 53, Issue 36

2014-06-23 Thread Michael Eickenberg
On Mon, Jun 23, 2014 at 4:45 PM, Luca Puggini wrote: > > Hi, > I know that LARS is usually faster. > On the other side CD is often considered more robust. In particular in > situation p>>n the Lars is not able to include in the model more than n > variables. > Which doesn't mean that this is not

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread Joel Nothman
Yes, that could happen too, perhaps. On 23 June 2014 10:56, abhishek wrote: > you mean the column sum is 0? > > > On Mon, Jun 23, 2014 at 4:48 PM, Joel Nothman > wrote: > >> In training, it seems one of those columns is constant. >> >> >> On 23 June 2014 10:40, abhishek wrote: >> >>> It is a

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread abhishek
you mean the column sum is 0? On Mon, Jun 23, 2014 at 4:48 PM, Joel Nothman wrote: > In training, it seems one of those columns is constant. > > > On 23 June 2014 10:40, abhishek wrote: > >> It is a multilabel problem. So the labels look like the following: >> >> [1 0 0] >> [0 1 0] >> [0 1 1]

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread Joel Nothman
https://github.com/scikit-learn/scikit-learn/pull/3308 is a fix for your error, however I think the data problem is something you should be aware of. On 23 June 2014 10:48, Joel Nothman wrote: > In training, it seems one of those columns is constant. > > > On 23 June 2014 10:40, abhishek wrote

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread Joel Nothman
In training, it seems one of those columns is constant. On 23 June 2014 10:40, abhishek wrote: > It is a multilabel problem. So the labels look like the following: > > [1 0 0] > [0 1 0] > [0 1 1] > > and so on.. > > > On Mon, Jun 23, 2014 at 4:31 PM, Joel Nothman > wrote: > >> Not that this er

Re: [Scikit-learn-general] Scikit-learn-general Digest, Vol 53, Issue 36

2014-06-23 Thread Luca Puggini
Hi, I know that LARS is usually faster. On the other side CD is often considered more robust. In particular in situation p>>n the Lars is not able to include in the model more than n variables. I think that the best think to do would be to include the possibility to choice which algorithm to use a

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread abhishek
It is a multilabel problem. So the labels look like the following: [1 0 0] [0 1 0] [0 1 1] and so on.. On Mon, Jun 23, 2014 at 4:31 PM, Joel Nothman wrote: > Not that this error is correct behaviour, but that you might not be aware > that there is a likely problem with your data. > > > On 23

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread Joel Nothman
Not that this error is correct behaviour, but that you might not be aware that there is a likely problem with your data. On 23 June 2014 10:30, Joel Nothman wrote: > It seems that there is a class label present in all training instances... > > > On 23 June 2014 10:20, abhishek wrote: > >> Hi a

Re: [Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread Joel Nothman
It seems that there is a class label present in all training instances... On 23 June 2014 10:20, abhishek wrote: > Hi all, > > Ive been getting this very weird error when using OneVsRestClassifier. > > > ---AttributeError >

Re: [Scikit-learn-general] Randomixed L1 regression and coordinate descent algorithm.

2014-06-23 Thread Alexandre Gramfort
hi, > I was wondering if there is any reason of why the randomized l1 algorithm > from the stability selection paper is implemented only using Lars Lasso and > not the coordinate descent algorithm. > I think than including a version of the algorithm with the coordinate > descent method would be ve

[Scikit-learn-general] OneVsRestClassifier : No attribute predict_proba

2014-06-23 Thread abhishek
Hi all, Ive been getting this very weird error when using OneVsRestClassifier. ---AttributeError Traceback (most recent call last) in ()> 1 preds_sgd = sgd.predict_proba(xtest) /usr/local/Cella

[Scikit-learn-general] Randomixed L1 regression and coordinate descent algorithm.

2014-06-23 Thread Luca Puggini
Hi, I was wondering if there is any reason of why the randomized l1 algorithm from the stability selection paper is implemented only using Lars Lasso and not the coordinate descent algorithm. I think than including a version of the algorithm with the coordinate descent method would be very useful.

[Scikit-learn-general] multiprocessing/forking.py freeze_support bug in Anaconda distro?

2014-06-23 Thread László Sándor
Hi, I was following your examples for some simple applications, but ran into a bug with CV and I am not sure where I should modify my script (does it have a "main module"?) or patch installed packages. Is this something scikit might need to fix? See more on StackOverflow: http://stackoverflow.com/

Re: [Scikit-learn-general] Getting decision tree regressor to predict using median not mean, of final subset

2014-06-23 Thread Joel Nothman
I think that should be Tree.apply, not apply_Tree. I.e. I guess you want to use something like (unverified): for leaf_ind, values in groupby(sorted(zip(regressor.tree_.apply(X_train), y_train)), operator.itemgetter(0)): regressor.tree_.values[leaf_ind, ...] = np.median(list(values)) On 23 Ju

Re: [Scikit-learn-general] Getting decision tree regressor to predict using median not mean, of final subset

2014-06-23 Thread Peter Prettenhofer
Hi James, if you look at the LAD loss function in the gradient_boosting module you can find an example how to do it. Basically, you need to update the values array in the Tree extension type. Tree.apply_Tree(x_train) gives you the training instances in each leaf. HTH, Peter Am 23.06.2014 13:48 sc

[Scikit-learn-general] Getting decision tree regressor to predict using median not mean, of final subset

2014-06-23 Thread James McMurray
Hi, I want to use the decision tree regressor to predict using the median of the resulting subset from the tree, rather than the mean? Is there a simple way to do this? I looked at the code, but in sklearn/tree/tree.py, the only relevant line is: proba = self.tree_.predict(X) Where the