Stronger Neyman Regret Guarantees for Adaptive Experimental Design
Georgy Noarov, Riccardo Fogliato, Martin Bertran, Aaron Roth
TL;DR
This work tackles efficient adaptive sequential experimentation for unbiased ATE estimation in a finite-population design-based framework by introducing Neyman regret as a central performance metric. It develops ClipOGD$^{\mathrm{SC}}$, an enhanced noncontextual adaptive design that achieves $\widetilde{O}(\log T)$ Neyman regret through strong convexity and cost-sensitive clipping, and provides valid confidence intervals for the adaptive IPW estimator. Extending to contextual settings, the authors propose MGATE, a multigroup adaptive design that attains $\widetilde{O}(\sqrt{T})$ multigroup Neyman regret across potentially overlapping groups using a sleeping-experts meta-design. Empirical results on synthetic data, microfinance, and LLM benchmarking corroborate the theoretical gains, demonstrating faster convergence to Neyman-optimal treatment probabilities and lower group-specific regret, with practical validity for ATE and GATE-style inference in adaptive experiments.
Abstract
We study the design of adaptive, sequential experiments for unbiased average treatment effect (ATE) estimation in the design-based potential outcomes setting. Our goal is to develop adaptive designs offering sublinear Neyman regret, meaning their efficiency must approach that of the hindsight-optimal nonadaptive design. Recent work [Dai et al, 2023] introduced ClipOGD, the first method achieving $\widetilde{O}(\sqrt{T})$ expected Neyman regret under mild conditions. In this work, we propose adaptive designs with substantially stronger Neyman regret guarantees. In particular, we modify ClipOGD to obtain anytime $\widetilde{O}(\log T)$ Neyman regret under natural boundedness assumptions. Further, in the setting where experimental units have pre-treatment covariates, we introduce and study a class of contextual "multigroup" Neyman regret guarantees: Given any set of possibly overlapping groups based on the covariates, the adaptive design outperforms each group's best non-adaptive designs. In particular, we develop a contextual adaptive design with $\widetilde{O}(\sqrt{T})$ anytime multigroup Neyman regret. We empirically validate the proposed designs through an array of experiments.
