Coarsened Propensity Scores and Hybrid Estimators for Missing Data and Causal Inference |
| |
Authors: | Jie Zhou Zhiwei Zhang Zhaohai Li Jun Zhang |
| |
Affiliation: | 1. Division of Biostatistics, Office of Surveillance and Biometrics, Center for Devices and Radiological Health, Food and Drug Administration, Silver Spring, Maryland, USA;2. Department of Statistics, George Washington University, Washington, DC, USA;3. MOE and Shanghai Key Laboratory of Children's Environmental Health, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China |
| |
Abstract: | In the areas of missing data and causal inference, there is great interest in doubly robust (DR) estimators that involve both an outcome regression (RG) model and a propensity score (PS) model. These DR estimators are consistent and asymptotically normal if either model is correctly specified. Despite their theoretical appeal, the practical utility of DR estimators has been disputed (e.g. Kang and Schaffer, Statistical Science 2007; 22: 523–539). One of the major concerns is the possibility of erratic estimates resulting from near‐zero denominators due to extreme values of the estimated PS. In contrast, the usual RG estimator based on the RG model alone is efficient when the RG model is correct and generally more stable than the DR estimators, although it can be biased when the RG model is incorrect. In light of the unique advantages of the RG and DR estimators, we propose a class of hybrid estimators that attempt to strike a reasonable balance between the RG and DR estimators. These hybrid estimators are motivated by heuristic arguments that coarsened PS estimates are less likely to take extreme values and less sensitive to misspecification of the PS model than the original model‐based PS estimates. The proposed estimators are compared with existing estimators in simulation studies and illustrated with real data from a large observational study on obstetric labour progression and birth outcomes. |
| |
Keywords: | Double robustness inverse probability weighting outcome regression propensity score stratification |
|
|