[R] Autocorrelation in linear models

Arni Magnusson Wed, 16 Mar 2011 18:09:09 -0700

I have been reading about autocorrelation in linear models over the lastcouple of days, and I have to say the more I read, the more confused Iget. Beyond confusion lies enlightenment, so I'm tempted to ask R-Help forguidance.

Most authors are mainly worried about autocorrelation in the residuals,but some authors are also worried about autocorrelation within Y andwithin X vectors before any model is fitted. Would you test forautocorrelation both in the data and in the residuals?

If we limit our worries to the residuals, it looks like we have a varietyof tests for lag=1:


  stats::cor.test(residuals(fm)[-n], residuals(fm)[-1])
  stats::Box.test(residuals(fm))
  lmtest::dwtest(fm, alternative="two.sided")
  lmtest::bgtest(fm, type="F")

In my model, a simple lm(y~x1+x2) with n=20 annual measurements, I havesignificant _positive_ autocorrelation within Y and within both X vectors,but _negative_ autocorrelation in the residuals. The residualautocorrelation is not quite significant, with the p-values

from the tests above. I seem to remember some authors saying that theDurbin-Watson test has less power than some alternative tests, asreflected here. The difference in p-values is substantial, so choosingwhich test to use could in many cases make a big difference for thesubsequent analysis and conclusions. Most of them (cor.test, Box.test,bgtest) can also test lags>1. Which test would you recommend? I imaginethe basic cor.test is somehow inappropriate for this; the other testswouldn't have been invented otherwise, right?

The car::dwt(fm) has p-values fluctuating by a factor of 2, unless I run avery long simulation, which results in a p-value similar tolmtest::dwtest, at least in my case.

Finally, one question regarding remedies. If there was significant_positive_ autocorrelation in the residuals, some authors suggestremedying this by deflating the df (fewer effective df in the data) andredo the t-tests of the regression coefficients, rejecting fewer nullhypotheses. Does that mean if the residuals are _negatively_ correlatedthen I should inflate the df (more effective df in the data) and rejectmore null hypotheses?

That's four question marks. I'd greatly appreciate guidance on any ofthem.


Thanks in advance,

Arni

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Autocorrelation in linear models

Reply via email to