In one set of simulation experiments I am finding that the Rubin 
variance-covariance formula works very well for regression imputation 
but that the standard error of the final regression coefficient for a 
frequently missing target variable is very much underestimated if PMM is 
used.  Does anyone have experience with this or know of a pertinent 
reference?  In doing PMM I have used both the closest match as part of 
the random-draw multiple imputation algorithm, and I have also tried 
weighted sampling where the closest match has the highest probability of 
being selected but donors around the closest may be selected with 
decreasing probability as they are farther away from the closest match. 
  Missingness of the target variable is moderately strongly related to 
observed values of another covariate (that has no missings).

Thanks
-- 
Frank E Harrell Jr   Professor and Chair           School of Medicine
                      Department of Biostatistics   Vanderbilt University

Reply via email to