Disentangled Graph Autoencoder for Treatment Effect Estimation
Di Fan, Renlei Jiang, Yunhao Wen, Chuanhou Gao
TL;DR
This work tackles the challenge of estimating individual treatment effects (ITE) from networked observational data in the presence of latent confounders. It introduces TNDVGA, a disentangled variational graph autoencoder that partitions latent information into four categories: instrumental ($\mathbf{z}_t$), confounding ($\mathbf{z}_c$), adjustment ($\mathbf{z}_y$), and noise ($\mathbf{z}_o$), and enforces their independence via HSIC. The model jointly optimizes an ELBO-based objective plus specialized losses for treatment and outcome prediction, independence regularization, and balanced representation, enabling accurate ITE estimation even when traditional unconfoundedness fails. Extensive experiments on synthetic and semi-synthetic networked datasets demonstrate state-of-the-art performance, highlighting the practical impact of disentangled latent-factor learning combined with network structure for counterfactual inference in domains such as healthcare and policy.
Abstract
Treatment effect estimation from observational data has attracted significant attention across various research fields. However, many widely used methods rely on the unconfoundedness assumption, which is often unrealistic due to the inability to observe all confounders, thereby overlooking the influence of latent confounders. To address this limitation, recent approaches have utilized auxiliary network information to infer latent confounders, relaxing this assumption. However, these methods often treat observed variables and networks as proxies only for latent confounders, which can result in inaccuracies when certain variables influence treatment without affecting outcomes, or vice versa. This conflation of distinct latent factors undermines the precision of treatment effect estimation. To overcome this challenge, we propose a novel disentangled variational graph autoencoder for treatment effect estimation on networked observational data. Our graph encoder disentangles latent factors into instrumental, confounding, adjustment, and noisy factors, while enforcing factor independence using the Hilbert-Schmidt Independence Criterion. Extensive experiments on multiple networked datasets demonstrate that our method outperforms state-of-the-art approaches.
