Re: [scikit-learn] Should we standardize data before PCA?

2018-05-27 Thread James Melenkevitz via scikit-learn
And this is the SciKit Learn page on the normalizing:  http://scikit-learn.org/stable/auto_examples/preprocessing/plot_scaling_importance.html On Saturday, May 26, 2018, 10:10:32 PM PDT, Shiheng Duan wrote: Thanks.  Do you mean that if feature one has a larger derivation than feature t

Re: [scikit-learn] Should we standardize data before PCA?

2018-05-27 Thread James Melenkevitz via scikit-learn
And this you have likely seen already in Wikipedia:https://en.wikipedia.org/wiki/Principal_component_analysis"...PCA is mostly used as a tool in exploratory data analysis and for making predictive models. It's often used to visualize genetic distance and relatedness between populations. PCA ca

Re: [scikit-learn] Should we standardize data before PCA?

2018-05-27 Thread James Melenkevitz via scikit-learn
Here are more reference involving the "score" that may help you: https://stats.stackexchange.com/questions/222/what-are-principal-component-scores https://stats.stackexchange.com/questions/202578/what-is-the-meaning-of-the-variable-scores-in-matlabs-pca ftp://statgen.ncsu.edu/pub/thorne/molevoclas

Re: [scikit-learn] Should we standardize data before PCA?

2018-05-24 Thread James Melenkevitz via scikit-learn
https://stats.stackexchange.com/questions/69157/why-do-we-need-to-normalize-data-before-principal-component-analysis-pca On Thursday, May 24, 2018, 4:41:07 PM PDT, Shiheng Duan wrote: Hello all, I wonder is it necessary or correct to do z score transformation before PCA? I didn't see

Re: [scikit-learn] (no subject)

2018-05-24 Thread James Melenkevitz via scikit-learn
I did some more tests.  My issue that I brought up may be related to the custom kernel.  On Thursday, May 24, 2018, 12:49:34 PM PDT, Gael Varoquaux wrote: On Thu, May 24, 2018 at 09:35:00PM +0530, aijaz qazi wrote: > scikit- multi learn is misleading. Yes, but I am not sure what sci

[scikit-learn] Support Vector Regression

2018-05-24 Thread James Melenkevitz via scikit-learn
I have an SVR model that uses custom kernel as follows: 1) sgk = dual_laplace_gaussian_swarm(ss) svr_cust_sig = SVR(kernel=sgk, C=C_Value, epsilon = epsilon_value) svr_fit = svr_cust_sig.fit(X, y) #X is an array shape is [93, 24]  where each row is a time in the columns are variables for the mode