Bias-Targeted Nonparametric Balancing for Stable Causal Mediation Analysis
Chang Liu, AmirEmad Ghassami
TL;DR
This paper tackles the instability of influence-function-based causal mediation estimators in the presence of continuous mediators by introducing a bias-targeted nuisance estimation framework. It reparametrizes the likelihood to avoid direct mediator-density estimation and employs nonparametric minimax balancing within RKHS to stabilize nuisance components, achieving consistency, multiple robustness, and asymptotic normality. The authors develop a two-stage nuisance estimation procedure for the single-mediator case and extend it to multiple mediators, including kernel-based implementations and theory for bias control. Through extensive simulations and an NHANES application, they demonstrate reduced bias and variance, improved inference, and substantive insights into obesity-related mediation of coronary heart disease via HbA1c.
Abstract
Influence function (IF)-based estimators are widely used in mediation analysis due to their modeling flexibility, but standard implementations require direct estimation of the distribution functions of the mediator and treatment variables. Since these functions appear in the denominator of IF-based estimators, they can induce significant instability, particularly with continuous mediators. In this work, we propose an alternative implementation of IF-based estimators for both single- and multiple-mediator settings, based on reparametrizations of the likelihood. The key idea is to estimate the involved nuisance functions according to their role in the bias structure of the IF-based estimators. In our approach, key nuisance functions that are potential sources of instability are estimated using a novel nonparametric weighted balancing method-which can be viewed as a nonparametric extension of covariate balancing generalized to mediation analysis-fully stabilizing the estimators. We establish consistency and multiple robustness under suitable regularity conditions, and asymptotic normality. Simulation studies demonstrate substantial reductions in bias and variance relative to existing methods for continuous mediators. We further illustrate the approach using NHANES 2013-2014 data to estimate the effect of obesity on coronary heart disease mediated by Glycohemoglobin.
