Re: [scikit-learn] Need Urgent help please in resolving JobLibMemoryError

2016-12-09 Thread Piotr Bialecki
Thanks in advance ! Cheers, Debu On Fri, Dec 9, 2016 at 2:48 PM, Piotr Bialecki mailto:piotr.biale...@hotmail.de>> wrote: Hi Debu, it seems that you run out of memory. Try using fewer processes. I don't think that n_jobs = 1000 will perform as you wish. Setting n_jobs to -1 uses

Re: [scikit-learn] Need Urgent help please in resolving JobLibMemoryError

2016-12-09 Thread Piotr Bialecki
Hi Debu, it seems that you run out of memory. Try using fewer processes. I don't think that n_jobs = 1000 will perform as you wish. Setting n_jobs to -1 uses the number of cores in your system. Greets, Piotr On 09.12.2016 08:16, Debabrata Ghosh wrote: Hi All, Greetings !

Re: [scikit-learn] NuSVC and ValueError: specified nu is infeasible

2016-12-08 Thread Piotr Bialecki
entirely 1000% sure of this anymore, but I think it still holds. Michael On Thu, Dec 8, 2016 at 11:08 AM, Piotr Bialecki mailto:piotr.biale...@hotmail.de>> wrote: Hi Michael, hi Thomas, I think the nu value is bound to (0, 1]. So the code will result in a ValueError (at least in sklear

Re: [scikit-learn] NuSVC and ValueError: specified nu is infeasible

2016-12-08 Thread Piotr Bialecki
s": clf = svm.NuSVC(probability=True) clf.fit(train_list_resampled3, train_activity_list_resampled3, sample_weight=None) then no, it does not converge. After all "sample_weight=None" is the default value. I am out of ideas about what may be the problem. Thomas

Re: [scikit-learn] no positive predictions by neural_network.MLPClassifier

2016-12-08 Thread Piotr Bialecki
Hi Thomas, Hi Thomas, besides that information of Sebastian, you dataset seems to be quite imbalances (48 positive and 1230 negative observations). You could try rebalancing your data using https://github.com/scikit-learn-contrib/imbalanced-learn This package offers some methods for resampling

Re: [scikit-learn] NuSVC and ValueError: specified nu is infeasible

2016-12-07 Thread Piotr Bialecki
Hi Thomas, the doc says, that nu gives an upper bound on the fraction of training errors and a lower bound of the fractions of support vectors. http://scikit-learn.org/stable/modules/generated/sklearn.svm.NuSVC.html Therefore, it acts as a hard bound on the allowed misclassification on your dat

Re: [scikit-learn] Calculating prediction probability per each predicted outcome

2016-10-26 Thread Piotr Bialecki
Hi Suranga, if you are using the MLPClassifier class, it should have a predict_proba() method. Try: predicted = neural_network.predict_proba(test_data) Best regards, Piotr On 26.10.2016 17:26, Suranga Kasthurirathne wrote: Hi everyone, I'm currently using Scikit learn to train and test multi

Re: [scikit-learn] tree visualization with class names in leaves

2016-10-25 Thread Piotr Bialecki
Hi Sanant, the values represent the thresholds at the current feature (node), which are used to classify the next sample. You can see an example here: http://scikit-learn.org/stable/modules/tree.html The first node uses the feature "petal length (cm)" with a threshold of 2.45. If your future s

Re: [scikit-learn] HashingVectorizer slow in version 0.18

2016-10-11 Thread Piotr Bialecki
I just tested it on my Ubuntu machine and could not see any performance issues (5.68 seconds in scikit-learn 0.17 vs. 6.67 seconds in scikit-learn 0.18) However, on another Windows 10 machine I could indeed see this issue: scikit-learn 0.17.1. Numpy 1.11.1. Python 2.7.12 AMD64 Vectorizing 20new

Re: [scikit-learn] ANN Scikit-learn 0.18 released

2016-10-11 Thread Piotr Bialecki
kowski.pl<mailto:mac...@wojcikowski.pl> 2016-10-11 14:32 GMT+02:00 Piotr Bialecki mailto:piotr.biale...@hotmail.de>>: Congratulations to all contributors! I would like to update to the new version using conda, but apparently it is not available: ~$ conda update scikit-learn Fetching

Re: [scikit-learn] ANN Scikit-learn 0.18 released

2016-10-11 Thread Piotr Bialecki
Congratulations to all contributors! I would like to update to the new version using conda, but apparently it is not available: ~$ conda update scikit-learn Fetching package metadata ... Solving package specifications: .. # All requested packages already installed. # packages in env

Re: [scikit-learn] Gradient Boosting: Feature Importances do not sum to 1

2016-09-09 Thread Piotr Bialecki
Hi Doug, I modified your code a little bit to calculate the feature_importances of every tree of the forest. In my opinion these feature importances should also sum to 1.0. Since I could not access each DecisionTreeRegressor of your GradientBoositngRegressor, I created a new ExtraTreeRegressor.

Re: [scikit-learn] Tuning custom parameters using grid_search

2016-09-07 Thread Piotr Bialecki
customestimator__my_param': [3], > 'logisticregression__C': [0.1, 1.0, 10.0]} > > gsearch1 = GridSearchCV(estimator=pipe, param_grid=grid) > > gsearch1.fit(X, y) > > > Then, you can put in your desired preprocessing stuff into fit and tra

[scikit-learn] Tuning custom parameters using grid_search

2016-09-07 Thread Piotr Bialecki
Hi all, I am currently tuning some parameters of my xgboost model using scikit's grid_search, e.g.: param_test1 = {'max_depth':range(3,10,2), 'min_child_weight':range(1,6,2) } gsearch1 = GridSearchCV(estimator = XGBClassifier(learning_rate =0.1, n_estimators=762,