Efficient Adaptive Experimentation with Noncompliance
Miruna Oprescu, Brian M Cho, Nathan Kallus
TL;DR
This work develops AMRIV, an Adaptive Multiply Robust estimator for estimating the population ATE in sequential experiments where treatment uptake is endogenous and only a binary instrument can be assigned. The authors derive the semiparametric efficiency bound under history-dependent instrument policies, identify the variance-aware optimal allocation π^*(X), and couple online policy learning with a sequential EIF-based estimator that attains the bound while maintaining multiply robust consistency. They prove asymptotic normality and convergence rates, along with time-uniform confidence sequences for sequential stopping. Empirical studies on synthetic and semi-synthetic data show that adaptive instrument assignment paired with AMRIV improves efficiency and robustness relative to baselines, enabling more reliable and timely causal inference in adaptive IV settings.
Abstract
We study the problem of estimating the average treatment effect (ATE) in adaptive experiments where treatment can only be encouraged -- rather than directly assigned -- via a binary instrumental variable. Building on semiparametric efficiency theory, we derive the efficiency bound for ATE estimation under arbitrary, history-dependent instrument-assignment policies, and show it is minimized by a variance-aware allocation rule that balances outcome noise and compliance variability. Leveraging this insight, we introduce AMRIV -- an Adaptive, Multiply-Robust estimator for Instrumental-Variable settings with variance-optimal assignment. AMRIV pairs (i) an online policy that adaptively approximates the optimal allocation with (ii) a sequential, influence-function-based estimator that attains the semiparametric efficiency bound while retaining multiply-robust consistency. We establish asymptotic normality, explicit convergence rates, and anytime-valid asymptotic confidence sequences that enable sequential inference. Finally, we demonstrate the practical effectiveness of our approach through empirical studies, showing that adaptive instrument assignment, when combined with the AMRIV estimator, yields improved efficiency and robustness compared to existing baselines.
