Using the Fraction of Missing Information to Identify Auxiliary Variables for Imputation Procedures via Proxy Pattern‐mixture Models |
| |
Authors: | Rebecca Andridge Katherine Jenny Thompson |
| |
Affiliation: | 1. College of Public Health, The Ohio State University, Columbus, OH, USA;2. Office of Statistical Methods and Research for Economic Programs, U.S. Census Bureau, Washington, DC, USA |
| |
Abstract: | In many surveys, imputation procedures are used to account for non‐response bias induced by either unit non‐response or item non‐response. Such procedures are optimised (in terms of reducing non‐response bias) when the models include covariates that are highly predictive of both response and outcome variables. To achieve this, we propose a method for selecting sets of covariates used in regression imputation models or to determine imputation cells for one or more outcome variables, using the fraction of missing information (FMI) as obtained via a proxy pattern‐mixture (PMM) model as the key metric. In our variable selection approach, we use the PPM model to obtain a maximum likelihood estimate of the FMI for separate sets of candidate imputation models and look for the point at which changes in the FMI level off and further auxiliary variables do not improve the imputation model. We illustrate our proposed approach using empirical data from the Ohio Medicaid Assessment Survey and from the Service Annual Survey. |
| |
Keywords: | Maximum likelihood fraction of missing information imputation proxy pattern‐mixture models |
|
|