Minimax and Bayes Optimal Adaptive Experimental Design for Treatment Choice

Masahiro Kato

Minimax and Bayes Optimal Adaptive Experimental Design for Treatment Choice

Masahiro Kato

TL;DR

The paper studies adaptive experiments for choosing between two binary treatments by framing the problem as a fixed-budget best-arm identification with regret. It proposes a two-stage Neyman allocation (TSNA) that first estimates variances with uniform allocation and then allocates samples in proportion to estimated standard deviations, followed by selecting the treatment with the largest sample mean. The authors prove that TSNA is asymptotically minimax and Bayes optimal by deriving tight lower bounds via change-of-measure arguments and showing matching upper bounds using CLT and large deviations. This approach provides a principled, distribution-agnostic allocation rule and connects treatment-choice optimization to efficient ATE estimation within mean-parameterized exponential-family outcome models.

Abstract

We consider an adaptive experiment for treatment choice and design a minimax and Bayes optimal adaptive experiment with respect to regret. Given binary treatments, the experimenter's goal is to choose the treatment with the highest expected outcome through an adaptive experiment, in order to maximize welfare. We consider adaptive experiments that consist of two phases, the treatment allocation phase and the treatment choice phase. The experiment starts with the treatment allocation phase, where the experimenter allocates treatments to experimental subjects to gather observations. During this phase, the experimenter can adaptively update the allocation probabilities using the observations obtained in the experiment. After the allocation phase, the experimenter proceeds to the treatment choice phase, where one of the treatments is selected as the best. For this adaptive experimental procedure, we propose an adaptive experiment that splits the treatment allocation phase into two stages, where we first estimate the standard deviations and then allocate each treatment proportionally to its standard deviation. We show that this experiment, often referred to as Neyman allocation, is minimax and Bayes optimal in the sense that its regret upper bounds exactly match the lower bounds that we derive. To show this optimality, we derive minimax and Bayes lower bounds for the regret using change-of-measure arguments. Then, we evaluate the corresponding upper bounds using the central limit theorem and large deviation bounds.

Minimax and Bayes Optimal Adaptive Experimental Design for Treatment Choice

TL;DR

Abstract

Minimax and Bayes Optimal Adaptive Experimental Design for Treatment Choice

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (20)