Unbiased Estimation for Total Treatment Effect Under Interference Using Aggregated Dyadic Data
Lu Deng, Yilin Li, JingJing Zhang, Yong Wang, Chuan Chen
TL;DR
This work tackles causal inference under network interference by modeling dyadic interactions and proposing an unbiased estimator for the total treatment effect (TTE) using aggregated dyadic data. By combining two Horvitz–Thompson–based estimators—one on dyadic outcomes $Y_i$ and one on diffusion-like dyadic outcomes $D_i$—into $\hat{\tau} = \hat{\tau}^{1} + \hat{\tau}^{2}$, the authors show that the estimator is unbiased for the full population when treatment is assigned with $\pi=0.5$, and that it exhibits reduced bias in subpopulation and two-stage cluster designs. Through extensive simulations on large social-network graphs and a real-world Weixin experiment, they demonstrate the practical advantages of their approach for estimating the total causal effect in settings with interference. The results offer a scalable, data-structure–driven method for improving causal estimates in online platforms where dyadic interactions drive outcomes.
Abstract
In social media platforms, user behavior is often influenced by interactions with other users, complicating the accurate estimation of causal effects in traditional A/B experiments. This study investigates situations where an individual's outcome can be broken down into the sum of multiple pairwise outcomes, a reflection of user interactions. These outcomes, referred to as dyadic data, are prevalent in many social network contexts. Utilizing a Bernoulli randomized design, we introduce a novel unbiased estimator for the total treatment effect (TTE), which quantifies the difference in population mean when all individuals are assigned to treatment versus control groups. We further explore the bias of our estimator in scenarios where it is impractical to include all individuals in the experiment, a common constraint in online control experiments. Our numerical results reveal that our proposed estimator consistently outperforms some commonly used estimators, underscoring its potential for more precise causal effect estimation in social media environments.
