A colleague, Carl Moons, and I are doing some simulations to study properties 
of multiple imputation using the MICE S-Plus library and using the S-Plus 
transcan function.  We are are getting some unexpected findings for which we 
would appreciate some of your insight.

First, our mechanism for setting some of the observations for one of the 
variables to missing in the simulations is to, with a constant probability, set 
a value to missing if the values of three other binary variables on that 
observation are simultaneously zero.  The imputation model is an additive model 
that does include these three other variables.  The ultimate Y model is also 
additive.  What do you expect to find if the mechanism for missingness is 
non-additive but the imputation and outcome models are additive?

Second, with the above missingness mechanism we find that multiple imputation 
using the response variable results in significantly less bias in that model's 
regression coefficient estimates as expected, but this improvement in bias is 
more than offset by an increase in variance.  Multiple imputations ignoring Y 
resulted in lower MSE of regression coefficient estimates.  Have other people 
found the same result?

Thanks in advance for any thoughts you have.  -Frank
-- 
Frank E Harrell Jr              Prof. of Biostatistics & Statistics
Div. of Biostatistics & Epidem. Dept. of Health Evaluation Sciences
U. Virginia School of Medicine  http://hesweb1.med.virginia.edu/biostat

Reply via email to