Causal Effect Estimation using identifiable Variational AutoEncoder with Latent Confounders and Post-Treatment Variables
Yang Xie, Ziqi Xu, Debo Cheng, Jiuyong Li, Lin Liu, Yinghao Zhang, Zaiwen Feng
TL;DR
This work tackles unbiased causal effect estimation from observational data in the presence of latent confounders and post-treatment variables. It introduces CPTiVAE, which jointly learns latent confounders via VAE from proxy $ ext{X}_ ext{C}$ and latent post-treatment variables via identifiable VAE from proxy $ ext{X}_ ext{M}$, incorporating identifiability guarantees for the latent post-treatment representation. The authors prove a theorem establishing $ ext{M}$-identifiability under mild conditions and demonstrate through synthetic, semi-synthetic (IHDP-based), and real-world (Adult) experiments that CPTiVAE outperforms state-of-the-art baselines in estimating $ATE$ and $CATE$, while maintaining robustness to hyperparameters and latent dimension choices. The approach offers a principled framework to adjust for both confounding and post-treatment biases, with practical implications for causal analysis in domains where post-treatment pathways are informative but unobserved. The work also provides public code, underscoring its potential for broad adoption in causal inference tasks that leverage proxy information.
Abstract
Estimating causal effects from observational data is challenging, especially in the presence of latent confounders. Much work has been done on addressing this challenge, but most of the existing research ignores the bias introduced by the post-treatment variables. In this paper, we propose a novel method of joint Variational AutoEncoder (VAE) and identifiable Variational AutoEncoder (iVAE) for learning the representations of latent confounders and latent post-treatment variables from their proxy variables, termed CPTiVAE, to achieve unbiased causal effect estimation from observational data. We further prove the identifiability in terms of the representation of latent post-treatment variables. Extensive experiments on synthetic and semi-synthetic datasets demonstrate that the CPTiVAE outperforms the state-of-the-art methods in the presence of latent confounders and post-treatment variables. We further apply CPTiVAE to a real-world dataset to show its potential application.
