Graph Machine Learning based Doubly Robust Estimator for Network Causal Effects
Seyedeh Baharan Khatami, Harsh Parikh, Haowei Chen, Sudeepa Roy, Babak Salimi
TL;DR
The paper tackles causal inference in social networks under interference and high-dimensional confounding by marrying graph neural networks with a doubly robust, semi-parametric inference framework. It develops a GDML approach that learns nuisance functions with GNNs and uses a Neyman orthogonal score within a cross-fitted focal-set procedure to estimate the average direct effect and average peer effect with valid confidence intervals. The authors prove consistency and asymptotic normality under regularity conditions and demonstrate scalability and robustness through semi-synthetic benchmarks on Cora, PubMed, Flickr, and a real Self-Help Group case in India. They also discuss identifiability, provide detailed theoretical derivations, and offer extensive complementary experiments to validate the framework and its generality across nuisance estimators and graph densities.
Abstract
We address the challenge of inferring causal effects in social network data. This results in challenges due to interference -- where a unit's outcome is affected by neighbors' treatments -- and network-induced confounding factors. While there is extensive literature focusing on estimating causal effects in social network setups, a majority of them make prior assumptions about the form of network-induced confounding mechanisms. Such strong assumptions are rarely likely to hold especially in high-dimensional networks. We propose a novel methodology that combines graph machine learning approaches with the double machine learning framework to enable accurate and efficient estimation of direct and peer effects using a single observational social network. We demonstrate the semiparametric efficiency of our proposed estimator under mild regularity conditions, allowing for consistent uncertainty quantification. We demonstrate that our method is accurate, robust, and scalable via an extensive simulation study. We use our method to investigate the impact of Self-Help Group participation on financial risk tolerance.
