Re: [R] Alternative to Scale Function?

Noah Silverman Fri, 11 Sep 2009 13:48:24 -0700

I think I just answered my own question.

The scale function will return the mean and sd of the data.


So the process is fairly simple.
scale training data varaible
note mean and sd from the scale

then manually scale the test data using the mean and sd from thetraining data.

That should make sure that a value is transformed the same regardless ofwhich data set it is in.


Do I have this correct, or can anybody contribute any more to the concept?

Thanks!


--
Noah

On 9/11/09 1:10 PM, Noah Silverman wrote:

Hi,
Is there an alternative to the scale function where I can specify myown mean and standard deviation?
I've come across an interesting issue where this would help.
I'm training and testing on completely different sets of data. Thetesting set is smaller than the training set.
Using the standard scale function of R seems to introduce some error.Since it scales data WITHIN the set, it may scale the same number todifferent value since the range in the training and testing set may bedifferent.
My thought was to scale the larger training set of data, then use themean and SD of the training data to scale the testing data accordingto the same parameters. That way a number will transform to the sameresult regardless of whether it is in the training or testing set.
I can't be the first one to have looked at this. Does anyone know ofa function in R or if there is a scale alternative where I can controlthe parameters?
Thanks!

--
Noah

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.


______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] Alternative to Scale Function?

Reply via email to