Re: [scikit-learn] SVC data normalisation

2017-05-08 Thread Brown J.B.
Dear Mamun, *A.* 80% features are binary [ 0 or 1 ] > *B.* 10% are integer values representing counts / occurrences. > *C.* 10% are continuous values between different ranges. > > My prior understanding was that decision tree based algorithms work better > on mixed data types. In this particular c

[scikit-learn] SVC data normalisation

2017-05-08 Thread Mamun Rashid
Hi All, I am testing two classifiers [ 1. Random forest 2. SVC with radial basis kernel ] on a data set via 5 fold cross validation. The feature matrix contains : A. 80% features are binary [ 0 or 1 ] B. 10% are integer values representing counts / occurrences. C. 10% are continuous values bet