Disentangled Instrumental Variables for Causal Inference with Networked Observational Data
Zhirong Huang, Debo Cheng, Guixian Zhang, Yi Wang, Jiuyong Li, Shichao Zhang
TL;DR
This work tackles causal effect estimation in networked observational data plagued by unobserved confounding. It introduces DisIV, a disentangled instrumental variable framework that uses network homogeneity as an inductive bias to isolate individual-specific latent IVs through structural disentanglement, enforcing exogeneity via orthogonality and exclusion constraints. The method employs an asymmetric inference–generation architecture with a confounder proxy learned from graph structure and a latent IV learned from observation features, optimized through a two-stage procedure that separates IV recovery from outcome modeling. Empirical results on semi-synthetic BlogCatalog and Flickr datasets show that DisIV consistently outperforms state-of-the-art baselines, with ablation and disentanglement analyses confirming the necessity and validity of the latent IVs for unbiased ITE and ATE estimation. The approach offers a scalable, principled path to causal inference in networked settings where environmental confounding and neighbor-induced endogeneity are entangled with individual-specific signals.
Abstract
Instrumental variables (IVs) are crucial for addressing unobservable confounders, yet their stringent exogeneity assumptions pose significant challenges in networked data. Existing methods typically rely on modelling neighbour information when recovering IVs, thereby inevitably mixing shared environment-induced endogenous correlations and individual-specific exogenous variation, leading the resulting IVs to inherit dependence on unobserved confounders and to violate exogeneity. To overcome this challenge, we propose $\underline{Dis}$entangled $\underline{I}$nstrumental $\underline{V}$ariables (DisIV) framework, a novel method for causal inference based on networked observational data with latent confounders. DisIV exploits network homogeneity as an inductive bias and employs a structural disentanglement mechanism to extract individual-specific components that serve as latent IVs. The causal validity of the extracted IVs is constrained through explicit orthogonality and exclusion conditions. Extensive semi-synthetic experiments on real-world datasets demonstrate that DisIV consistently outperforms state-of-the-art baselines in causal effect estimation under network-induced confounding.
