Causal Dynamic Variational Autoencoder for Counterfactual Regression in Longitudinal Data
Mouad El Bouchattaoui, Myriam Tami, Benoit Lepetit, Paul-Henry Cournède
TL;DR
The paper tackles causal effect estimation in longitudinal settings with unobserved static adjustment variables by introducing CDVAE, a causal dynamic variational autoencoder that learns latent substitutes to augment ACATE estimation under sequential ignorability. It establishes identifiability of the augmented treatment effect using a finite-order conditional Markov model and derives a generalization bound linking representation learning, covariate balance, and model loss. The near-deterministic analysis shows that as the decoder variance vanishes, causal estimates become realization-invariant and consistent. Empirical evaluations on synthetic and semi-synthetic (MIMIC-III) data demonstrate that CDVAE outperforms baselines and that augmenting existing models with CDVAE substitutes yields near-oracle performance, highlighting the practical impact for personalized decision-making in medicine and beyond.
Abstract
Accurately estimating treatment effects over time is crucial in fields such as precision medicine, epidemiology, economics, and marketing. Many current methods for estimating treatment effects over time assume that all confounders are observed or attempt to infer unobserved ones. In contrast, our approach focuses on unobserved adjustment variables, which specifically have a causal effect on the outcome sequence. Under the assumption of unconfoundedness, we address estimating Conditional Average Treatment Effects (CATEs) while accounting for unobserved heterogeneity in response to treatment due to these unobserved adjustment variables. Our proposed Causal Dynamic Variational Autoencoder (CDVAE) is grounded in theoretical guarantees concerning the validity of latent adjustment variables and generalization bounds on CATE estimation error. Extensive evaluations on synthetic and real-world datasets show that CDVAE outperforms existing baselines. Moreover, we demonstrate that state-of-the-art models significantly improve their CATE estimates when augmented with the latent substitutes learned by CDVAE, approaching oracle-level performance without direct access to the true adjustment variables.
