[R] Identify Leverage Points

Noah Silverman Fri, 19 Jul 2013 15:39:31 -0700

Hello,

I'm working on some fairly standard regression models (linear, logistic, and 
poisson.)  Unfortunately, the data is rather messy.


A visual inspection, using either a histogram or a density plot indicates some 
significant outliers.  Furthermore, summary statistics of the data indicate the 
same thing.

If I fit a linear regression in R using the "lm" command, I can then plot the 
model to look at residuals, etc.

I'm interesting in re-fitting the model with a N% of the high leverage points 
removed.   (Large data set, want to fit "most" of the data.)

Is there a computational way to get the leverage for each data point?  That way 
I can subset the data skipping N% of the highest leverage ones.


Thanks!


--
Noah Silverman, M.S., C.Phil
UCLA Department of Statistics
8117 Math Sciences Building
Los Angeles, CA 90095

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] Identify Leverage Points

Reply via email to