Re: [R] partial residuals & the output of residuals.lm(..., type="partial")

Daniel McGlinn Wed, 18 Feb 2009 18:58:40 -0800

Dear list,

After thinking about it a little more I solved my question of why I wascalculating different residuals when usingresiduals.lm(...,type="partial") and when dropping a single term andrecalculating the residuals. This is because the two variables are in asense competing with one another in the full model if they are notcompletely orthogonal to one another. For example, with my hypotheticalexample from before if you make x1 and x2 and more correlated then thediscrepancy between the two sets of residuals increases, but the problemcan be solved if you make sure to use the same coefficients from thefull model when computing the raw residuals without the other variable.

Dan

Here is the example that shows that more correlated x-variables make theproblem even worse and a solution to my original question.


set.seed(12)
x1<-runif(100)

x2<-x1+runif(100) ##this will make x1 and x2 more strongly correlatedthan in the first example (see original message)

y<-.13+.25*x1+.70*x2+runif(100)
mod<-lm(y~x1+x2)
plot(residuals(mod,type="partial")[,2],residuals(update(mod,.~.-x2),type='response'))
abline(0,1)
##note how the degree of scatter increases around the 1:1 line
##here is the solution to the problem

##calculate the residuals by hand and make sure to use the estimatedcoefficients from the full modelcalc.resids<-y-cbind(rep(1,100),x1)%*%coef(mod)[-3] ##as before I willdrop the influence of x2 from the model prediction

##center the calculated residuals
calc.resids<- calc.resids-mean(calc.resids)

plot(residuals(mod,type="partial")[,2],calc.resids)
abline(0,1)##now all the points fall right on the line


-------- Original Message --------
Subject: partial residuals & the output of residuals.lm(...,type="partial")
From: Daniel McGlinn <daniel.mcgl...@okstate.edu>
To: r-help@r-project.org <r-help@r-project.org>
Date: 2/18/2009 7:52 PM

Dear list,
I would like to know how the function residuals.lm calculates thepartial residuals from an lm object with more than one predictorvariable. In other words what is residuals.lm(...,type="partial") doingbehind the scenes? According to the help file for residuals.lm(?residuals.lm), "The partial residuals are a matrix with each columnformed by omitting a term from the model". Unfortunately, I cannot seemto recreate the results of the function "residuals.lm" by simplydropping a variable from a model and then calculating the raw residualsof the updated model. Can anyone see what I am overlooking? It may behelpful to others if I mention that the usage ofresiduals.lm(...,type='partial') by the function termplot is whatmotivated me to look at this function more closely. Below is a simpleexample to illustrate my question:
set.seed(12)

x1 <- runif(100)
x2 <- runif(100)

y <- .13+.25*x1+.70*x2+runif(100)

mod <- lm(y~x1+x2)
##let's only consider the partial residuals when x2 is dropped from themodel
plot(residuals(mod,type="partial")[,2],residuals(update(mod,.~.-x2),type='response'))
abline(0,1) ##1:1 line
##why do the points not all fall on the 1:1 line?

Thanks,
Dan


--
Daniel J. McGlinn
Department of Botany, Oklahoma State University
117 LSE Stillwater OK 74078 USA 405-612-1780
http://ecology.okstate.edu/Libra/

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Re: [R] partial residuals & the output of residuals.lm(..., type="partial")

Reply via email to