Optimal e-value testing for properly constrained hypotheses

Eugenio Clerico

Optimal e-value testing for properly constrained hypotheses

Eugenio Clerico

TL;DR

The paper presents a rigorous characterization of the optimal e-class for hypothesis testing via e-values under properly constrained, non-parametric hypotheses defined by finitely many regular constraints. It proves that the optimal e-class coincides with the dual e-class $\mathcal{E}_\mathcal{H}^\vee$ and establishes existence in finite domains, extends to compact and general closed domains via matching-set techniques, and extends to non-properly and loosely constrained cases. The results are directly applicable to constructing tight confidence sequences for means, including bounded and heavy-tailed settings, by restricting to the optimal e-class. This work thus provides a principled, tractable framework for designing sequential tests with e-values, with practical impact on adaptive data collection and mean-estimation tasks while connecting to classical admissibility concepts and laying groundwork for broader sequential testing beyond single-round e-variables.

Abstract

Hypothesis testing via e-variables can be framed as a sequential betting game, where a player each round picks an e-variable. A good player's strategy results in an effective statistical test that rejects the null hypothesis as soon as sufficient evidence arises. Building on recent advances, we address the question of restricting the pool of e-variables to simplify strategy design without compromising effectiveness. We extend the results of Clerico(2024), by characterising optimal sets of e-variables for a broad class of non-parametric hypothesis tests, defined by finitely many regular constraints. As an application, we discuss this notion of optimality in algorithmic mean estimation, including for heavy-tailed random variables.

Optimal e-value testing for properly constrained hypotheses

TL;DR

and establishes existence in finite domains, extends to compact and general closed domains via matching-set techniques, and extends to non-properly and loosely constrained cases. The results are directly applicable to constructing tight confidence sequences for means, including bounded and heavy-tailed settings, by restricting to the optimal e-class. This work thus provides a principled, tractable framework for designing sequential tests with e-values, with practical impact on adaptive data collection and mean-estimation tasks while connecting to classical admissibility concepts and laying groundwork for broader sequential testing beyond single-round e-variables.

Abstract

Paper Structure (22 sections, 38 theorems, 31 equations)

This paper contains 22 sections, 38 theorems, 31 equations.

Introduction
Structure
Notation
Algorithmic hypothesis testing
Majorising e-classes and optimal e-class
Properly constrained hypotheses
Dual e-class
Domains with finite cardinality
Matching sets
Optimality of the dual e-class
Extensions
Testing finitely (non-properly) constrained hypotheses
Loosely constrained hypotheses
Algorithmic mean estimation
Mean estimation for bounded random variables
...and 7 more sections

Key Result

Proposition 1

Let $\mathcal{H}$ be a hypothesis on $\mathcal{X}$ and consider a sequence $(X_t)_{t\geq 1}\subseteq\mathcal{X}$ of independent draws from some $P\in\mathcal{H}$. Fix $\delta\in(0,1)$ and $\mathcal{E} \subseteq \mathcal{E}_\mathcal{H}$. Consider an $\mathcal{E}$-restricted testing game on $\mathcal{

Theorems & Definitions (83)

Definition 1: Hypotheses and e-variables
Definition 2: Testing game
Proposition 1
Definition 3: Majorising and optimal e-classes
Definition 4: Maximal e-variables
Lemma 1
Corollary 1
proof
Lemma 2
proof
...and 73 more

Optimal e-value testing for properly constrained hypotheses

TL;DR

Abstract

Optimal e-value testing for properly constrained hypotheses

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (83)