Dynamic treatment effects: high-dimensional inference under model misspecification
Yuqian Zhang, Weijie Ji, Jelena Bradic
TL;DR
The paper tackles dynamic treatment effects under time-varying confounding with high-dimensional covariates and potential misspecification. It introduces the sequential model doubly robust (SMDR) estimator, built on moment-targeted nuisance estimates and a suite of losses that enforce orthogonality conditions even when nuisance models are misspecified. The authors prove root-$N$ inference for the dynamic treatment effect parameter $ heta_{1,1}$ under minimal assumptions, and derive nuisance-estimator rates under misspecification and correct specification, along with sparsity conditions. Through simulations and semi-synthetic NJCS analysis, SMDR consistently outperforms competing methods in bias, RMSE, and coverage, demonstrating robustness to misspecification in high dimensions. The work advances high-dimensional causal inference in longitudinal settings and paves the way for extensions to more complex dynamic regimes and dense models.
Abstract
Estimating dynamic treatment effects is crucial across various disciplines, providing insights into the time-dependent causal impact of interventions. However, this estimation poses challenges due to time-varying confounding, leading to potentially biased estimates. Furthermore, accurately specifying the growing number of treatment assignments and outcome models with multiple exposures appears increasingly challenging to accomplish. Double robustness, which permits model misspecification, holds great value in addressing these challenges. This paper introduces a novel "sequential model doubly robust" estimator. We develop novel moment-targeting estimates to account for confounding effects and establish that root-$N$ inference can be achieved as long as at least one nuisance model is correctly specified at each exposure time, despite the presence of high-dimensional covariates. Although the nuisance estimates themselves do not achieve root-$N$ rates, the carefully designed loss functions in our framework ensure final root-$N$ inference for the causal parameter of interest. Unlike off-the-shelf high-dimensional methods, which fail to deliver robust inference under model misspecification even within the doubly robust framework, our newly developed loss functions address this limitation effectively.
