Deep Doubly Debiased Longitudinal Effect Estimation with ICE G-Computation
Wenxin Chen, Weishen Pan, Kyra Gan, Fei Wang
TL;DR
This work tackles the difficulty of estimating longitudinal treatment effects under time-varying confounding and treatment–confounder feedback by introducing D3-Net, a two-stage debiasing framework. During training, SDR-based targets are used to stabilize the recursive ICE G-computation with a multi-task Transformer that also includes a covariate-simulator head and a target network. For inference, the model discards SDR corrections and applies LTMLE to the original nuisance models, achieving robust finite-sample properties. Across semi-synthetic MIMIC-based experiments, D3-Net consistently reduces bias and variance across horizons and confounding regimes, with ablations confirming the central role of SDR-based training and the stabilizing benefit of LTMLE re-debiasing. The approach supports robust evaluation of longitudinal policies and could improve decision-making in sequential treatment settings while highlighting the importance of combining debiasing stages in deep causal estimators.
Abstract
Estimating longitudinal treatment effects is essential for sequential decision-making but is challenging due to treatment-confounder feedback. While Iterative Conditional Expectation (ICE) G-computation offers a principled approach, its recursive structure suffers from error propagation, corrupting the learned outcome regression models. We propose D3-Net, a framework that mitigates error propagation in ICE training and then applies a robust final correction. First, to interrupt error propagation during learning, we train the ICE sequence using Sequential Doubly Robust (SDR) pseudo-outcomes, which provide bias-corrected targets for each regression. Second, we employ a multi-task Transformer with a covariate simulator head for auxiliary supervision, regularizing representations against corruption by noisy pseudo-outcomes, and a target network to stabilize training dynamics. For the final estimate, we discard the SDR correction and instead use the uncorrected nuisance models to perform Longitudinal Targeted Minimum Loss-Based Estimation (LTMLE) on the original outcomes. This second-stage, targeted debiasing ensures robustness and optimal finite-sample properties. Comprehensive experiments demonstrate that our model, D3-Net, robustly reduces bias and variance across different horizons, counterfactuals, and time-varying confoundings, compared to existing state-of-the-art ICE-based estimators.
