Table of Contents
Fetching ...

Optimal e-value testing for properly constrained hypotheses

Eugenio Clerico

TL;DR

The paper presents a rigorous characterization of the optimal e-class for hypothesis testing via e-values under properly constrained, non-parametric hypotheses defined by finitely many regular constraints. It proves that the optimal e-class coincides with the dual e-class $\mathcal{E}_\mathcal{H}^\vee$ and establishes existence in finite domains, extends to compact and general closed domains via matching-set techniques, and extends to non-properly and loosely constrained cases. The results are directly applicable to constructing tight confidence sequences for means, including bounded and heavy-tailed settings, by restricting to the optimal e-class. This work thus provides a principled, tractable framework for designing sequential tests with e-values, with practical impact on adaptive data collection and mean-estimation tasks while connecting to classical admissibility concepts and laying groundwork for broader sequential testing beyond single-round e-variables.

Abstract

Hypothesis testing via e-variables can be framed as a sequential betting game, where a player each round picks an e-variable. A good player's strategy results in an effective statistical test that rejects the null hypothesis as soon as sufficient evidence arises. Building on recent advances, we address the question of restricting the pool of e-variables to simplify strategy design without compromising effectiveness. We extend the results of Clerico(2024), by characterising optimal sets of e-variables for a broad class of non-parametric hypothesis tests, defined by finitely many regular constraints. As an application, we discuss this notion of optimality in algorithmic mean estimation, including for heavy-tailed random variables.

Optimal e-value testing for properly constrained hypotheses

TL;DR

The paper presents a rigorous characterization of the optimal e-class for hypothesis testing via e-values under properly constrained, non-parametric hypotheses defined by finitely many regular constraints. It proves that the optimal e-class coincides with the dual e-class and establishes existence in finite domains, extends to compact and general closed domains via matching-set techniques, and extends to non-properly and loosely constrained cases. The results are directly applicable to constructing tight confidence sequences for means, including bounded and heavy-tailed settings, by restricting to the optimal e-class. This work thus provides a principled, tractable framework for designing sequential tests with e-values, with practical impact on adaptive data collection and mean-estimation tasks while connecting to classical admissibility concepts and laying groundwork for broader sequential testing beyond single-round e-variables.

Abstract

Hypothesis testing via e-variables can be framed as a sequential betting game, where a player each round picks an e-variable. A good player's strategy results in an effective statistical test that rejects the null hypothesis as soon as sufficient evidence arises. Building on recent advances, we address the question of restricting the pool of e-variables to simplify strategy design without compromising effectiveness. We extend the results of Clerico(2024), by characterising optimal sets of e-variables for a broad class of non-parametric hypothesis tests, defined by finitely many regular constraints. As an application, we discuss this notion of optimality in algorithmic mean estimation, including for heavy-tailed random variables.
Paper Structure (22 sections, 38 theorems, 31 equations)

This paper contains 22 sections, 38 theorems, 31 equations.

Key Result

Proposition 1

Let $\mathcal{H}$ be a hypothesis on $\mathcal{X}$ and consider a sequence $(X_t)_{t\geq 1}\subseteq\mathcal{X}$ of independent draws from some $P\in\mathcal{H}$. Fix $\delta\in(0,1)$ and $\mathcal{E} \subseteq \mathcal{E}_\mathcal{H}$. Consider an $\mathcal{E}$-restricted testing game on $\mathcal{

Theorems & Definitions (83)

  • Definition 1: Hypotheses and e-variables
  • Definition 2: Testing game
  • Proposition 1
  • Definition 3: Majorising and optimal e-classes
  • Definition 4: Maximal e-variables
  • Lemma 1
  • Corollary 1
  • proof
  • Lemma 2
  • proof
  • ...and 73 more