Graph Disentangle Causal Model: Enhancing Causal Inference in Networked Observational Data
Binbin Hu, Zhicheng An, Zhengwei Wu, Ke Tu, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Yufei Feng, Jiawei Chen
TL;DR
The paper tackles estimating individualized treatment effects in networked observational data where hidden confounders exist. It introduces the Graph Disentangle Causal Model (GDC), a three-part framework with a causal disentangle module, a three-path graph aggregation module, and a causal constraint module to learn true adjustment and confounder factors and their counterfactuals. By separating features into adjustment and confounding components and aggregating them with treatment-aware graph attention, GDC yields more accurate factual and counterfactual predictions, validated on semi-synthetic BlogCatalog and Flickr datasets where it outperforms baselines. This approach offers a principled way to mitigate confounding bias in graphs while preserving predictive power, with implications for personalized treatment and recommendations in networked domains.
Abstract
Estimating individual treatment effects (ITE) from observational data is a critical task across various domains. However, many existing works on ITE estimation overlook the influence of hidden confounders, which remain unobserved at the individual unit level. To address this limitation, researchers have utilized graph neural networks to aggregate neighbors' features to capture the hidden confounders and mitigate confounding bias by minimizing the discrepancy of confounder representations between the treated and control groups. Despite the success of these approaches, practical scenarios often treat all features as confounders and involve substantial differences in feature distributions between the treated and control groups. Confusing the adjustment and confounder and enforcing strict balance on the confounder representations could potentially undermine the effectiveness of outcome prediction. To mitigate this issue, we propose a novel framework called the \textit{Graph Disentangle Causal model} (GDC) to conduct ITE estimation in the network setting. GDC utilizes a causal disentangle module to separate unit features into adjustment and confounder representations. Then we design a graph aggregation module consisting of three distinct graph aggregators to obtain adjustment, confounder, and counterfactual confounder representations. Finally, a causal constraint module is employed to enforce the disentangled representations as true causal factors. The effectiveness of our proposed method is demonstrated by conducting comprehensive experiments on two networked datasets.
