Optimal Conditional Inference in Adaptive Experiments

Jiafeng Chen; Isaiah Andrews

Optimal Conditional Inference in Adaptive Experiments

Jiafeng Chen, Isaiah Andrews

TL;DR

This work addresses inference in batched adaptive experiments by conditioning on the realized experimental design. It shows that, without restrictions, last-batch-only inference is optimal, but for location-invariant designs there is a leftover statistic $L$ from earlier batches that improves conditional inference, culminating in a GLS-type estimator for $\eta'\mu$. When the design is further restricted to polyhedral forms, the authors derive computationally tractable, optimal conditional procedures using a truncated-normal framework. The results are supported by uniform-asymptotic theory and simulation evidence, showing substantial gains in interval tightness while preserving conditional validity. Together, these contributions enhance reliable inference in adaptive experiments and offer practical guidance for pilot studies and polyhedral algorithm settings.

Abstract

We study batched bandit experiments and consider the problem of inference conditional on the realized stopping time, assignment probabilities, and target parameter, where all of these may be chosen adaptively using information up to the last batch of the experiment. Absent further restrictions on the experiment, we show that inference using only the results of the last batch is optimal. When the adaptive aspects of the experiment are known to be location-invariant, in the sense that they are unchanged when we shift all batch-arm means by a constant, we show that there is additional information in the data, captured by one additional linear function of the batch-arm means. In the more restrictive case where the stopping time, assignment probabilities, and target parameter are known to depend on the data only through a collection of polyhedral events, we derive computationally tractable and optimal conditional inference procedures.

Optimal Conditional Inference in Adaptive Experiments

TL;DR

from earlier batches that improves conditional inference, culminating in a GLS-type estimator for

. When the design is further restricted to polyhedral forms, the authors derive computationally tractable, optimal conditional procedures using a truncated-normal framework. The results are supported by uniform-asymptotic theory and simulation evidence, showing substantial gains in interval tightness while preserving conditional validity. Together, these contributions enhance reliable inference in adaptive experiments and offer practical guidance for pilot studies and polyhedral algorithm settings.

Abstract

Paper Structure (26 sections, 24 theorems, 191 equations, 2 figures, 2 tables)

This paper contains 26 sections, 24 theorems, 191 equations, 2 figures, 2 tables.

Introduction
Problem setup
Inference problem
Conditional inference for location-invariant algorithms
Gain relative to last-batch inference
Conditioning on $\Delta X_{1:T-1}$ in Thompson sampling
Uniform asymptotics for location-invariant algorithms
General conditional inference
Conditional inference for polyhedral algorithms
Simulation evidence
Conclusion
Proofs for results in the main text
Improvability
Leftover information $L$
Sufficiency in Thompson sampling
...and 11 more sections

Key Result

lemma 1

Suppose eq:conditional_requirement holds for all measurable $\eta(\cdot)$, then for all fixed $\eta \in \mathbb{R}^K$

Figures (2)

Figure 1: Distribution of $t$-statistics for adaptive target in the Thompson sampling experiment
Figure 2: Conditional coverage in $\varepsilon$-greedy experiment

Theorems & Definitions (53)

lemma 1
theorem 1
theorem 2
theorem 3
proof : Notes
proof : Notes
proof : Notes
proof : Notes
lemma 1
proof
...and 43 more

Optimal Conditional Inference in Adaptive Experiments

TL;DR

Abstract

Optimal Conditional Inference in Adaptive Experiments

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (53)