Multiple Imputation Methods under Extreme Values
Enzo Porto Brasil
TL;DR
This study evaluates six MI methods from MICE to understand their performance when missing data co-occur with extreme values. Through Monte Carlo simulations on a three-variable normal design, it compares parametric, donor-based, and nonparametric MI approaches under MCAR, with downstream models switching between OLS (clean) and elastic net (contaminated). Key findings show parametric MI often yields tighter predictive tails and better out-of-sample CV-MSE, while donor-based and ML approaches can reduce slope bias at the expense of heavier tails; sample size mitigates dispersion but does not fully erase contamination-induced distortions. The work provides practical guidance: choose MI methods considering missingness, extremes, and downstream modeling goals, and emphasizes examining the missingness mechanism and the presence of extremes before method selection. The results have direct implications for applied research where extremes and MAR/MAR patterns interact with imputation, influencing both predictive performance and inferential validity.
Abstract
Missing data are ubiquitous in empirical databases, yet statistical analyses typically require complete data matrices. Multiple imputation offers a principled solution for filling these gaps. This study evaluates the performance of several multiple imputation methods, both in the presence and absence of extreme values, using the MICE package in R. Through Monte Carlo simulations, we generated incomplete data sets with three variables and assessed each imputation method within regression models. The results indicate that the linear regression based imputation method showed the best overall predictive performance (CV-MSE), whereas the sparse model approach was generally less efficient. Our findings underscore the relevance of extreme values when selecting an imputation strategy and highlight sample size, proportion of missingness, presence of extremes, and the type of fitted model as key determinants of performance. Despite its limitations, the study offers practical recommendations for researchers, stressing the need to examine the missingness mechanism and the occurrence of extreme values before choosing an imputation method.
