Differences-in-Neighbors for Network Interference in Experiments
Tianyi Peng, Naimeng Ye, Andrew Zheng
TL;DR
The paper tackles network interference in experiments by proposing Differences-in-Neighbors (DN), an estimator that reduces bias from interference while keeping variance tractable, especially when used with clustering. DN achieves second-order bias in the interference strength and variance that grows polynomially with network degree, offering a superior bias-variance tradeoff compared to naive DM and HT estimators. The authors develop a cluster-aware DN variant, prove theoretical guarantees including bias and variance bounds via Taylor expansions and smoothness assumptions, and demonstrate practical gains on small-world and real-world networks as well as a ridesharing simulator. The work bridges design and estimation choices, enabling smaller clusters and more accurate ATE estimation under complex interference patterns. Overall, DN extends the toolkit for scalable, robust causal inference in interconnected systems with practical implications for large online platforms.
Abstract
Experiments in online platforms frequently suffer from network interference, in which a treatment applied to a given unit affects outcomes for other units connected via the platform. This SUTVA violation biases naive approaches to experiment design and estimation. A common solution is to reduce interference by clustering connected units, and randomizing treatments at the cluster level, typically followed by estimation using one of two extremes: either a simple difference-in-means (DM) estimator, which ignores remaining interference; or an unbiased Horvitz-Thompson (HT) estimator, which eliminates interference at great cost in variance. Even combined with clustered designs, this presents a limited set of achievable bias variance tradeoffs. We propose a new estimator, dubbed Differences-in-Neighbors (DN), designed explicitly to mitigate network interference. Compared to DM estimators, DN achieves bias second order in the magnitude of the interference effect, while its variance is exponentially smaller than that of HT estimators. When combined with clustered designs, DN offers improved bias-variance tradeoffs not achievable by existing approaches. Empirical evaluations on a large-scale social network and a city-level ride-sharing simulator demonstrate the superior performance of DN in experiments at practical scale.
