Sensitivity analysis for contamination in egocentric-network randomized trials with interference
Bar Weinstein, Daniel Nevo
TL;DR
This paper addresses contamination in egocentric-network randomized trials (ENRTs) operating under interference when full sociocentric data are unavailable. It develops a design-based framework showing that Horvitz-Thompson estimators for the indirect and direct effects are biased under network contamination and introduces bias-corrected estimators that depend on sensitivity parameters for missing edges, including an interaction parameter κ for egos. The authors implement a dual sensitivity-analysis toolkit—grid sensitivity analysis (GSA) and probabilistic bias analysis (PBA)—and illustrate their approach with simulations and the HIV Prevention Trials Network 037 study, revealing that ignoring contamination can misstate both indirect and direct effects. The methods are accompanied by an R package ENRTsensitivity and offer practical guidance for robustness assessment of ENRT causal estimates in settings where complete network data are impractical to obtain.
Abstract
Egocentric-Network Randomized Trials (ENRTs) are increasingly used to estimate causal effects under interference when measuring complete sociocentric network data is infeasible. ENRTs rely on egocentric network sampling, where a set of egos is first sampled, and each ego recruits a subset of its neighbors as alters. Treatments are then randomized across egos. While the observed ego-networks are disjoint by design, the underlying population network may contain edges connecting them, leading to contamination. Under a design-based framework, we show that the Horvitz-Thompson estimators of direct and indirect effects are biased whenever contamination is present. To address this, we derive bias-corrected estimators and propose a novel sensitivity analysis framework based on sensitivity parameters representing the probability or expected number of missing edges. This framework is implemented via both grid sensitivity analysis and probabilistic bias analysis, providing researchers with a flexible tool to assess the robustness of the causal estimators to contamination. We apply our methodology to the HIV Prevention Trials Network 037 study, finding that ignoring contamination may lead to underestimation of indirect effects and overestimation of direct effects.
