On 2010-06-11 2:16, Sandra Hawthorne wrote:
Hi,

I'm trying to verify the calculation of coefficient of determination (r 
squared) for linear regression. I've done the calculation manually with a 
simple test case and using the definition of r squared outlined in summary(lm) 
help. There seems to be a discrepancy between the what R produced and the 
manual calculation. Does anyone know why this is so? What does the multiple r 
squared reported in summary(lm) represent?

# The test case:
x<- c(1,2,3,4)
y<- c(1.6,4.4,5.5,8.3)
dummy<- data.frame(x, y)
fm1<- lm(y ~ x-1, data = dummy)
summary(fm1)
betax<- fm1$coeff[x] * sd(x) / sd(y)
# cd is coefficient of determination
cd<- betax * cor(y, x)

The discrepancy is due to incorrect manual calculation.
You're using (incorrectly, at that) formulas for simple
regression _with an intercept term_ whereas you model
has _no_ intercept term.

What summary.lm reports is clearly described on the
help page. See r.squared in the Value section.

  -Peter Ehlers

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to