Table of Contents
Fetching ...

Rao-Blackwellized e-variables

Dante de Roos, Ben Chugg, Peter Grünwald, Aaditya Ramdas

TL;DR

The paper establishes a general Rao-Blackwellization principle for e-statistics, showing that conditioning an e-variable (or its sequential analogues) on a sufficient statistic can only improve, under any concave utility, the expected value across the parameter space. It extends the result to log-utility with weaker existence conditions, to e-processes via sufficient filtrations, and to compound and asymptotic e-variables, providing a versatile theoretical tool. Concrete applications demonstrate how the approach recovers optimal e-variables in fixed-design linear regression and Pareto data, and how permutation-based averaging can enhance GRO/GROW properties for i.i.d. data. Collectively, the results offer a unifying framework to design and analyze powerful e-statistics across static and sequential testing contexts.

Abstract

We show that for any concave utility, the expected utility of an e-variable can only increase after conditioning on a sufficient statistic. The simplest form of the result has an extremely straightforward proof, which follows from a single application of Jensen's inequality. Similar statements hold for compound e-variables, asymptotic e-variables, and e-processes. These results echo the Rao-Blackwell theorem, which states that the expected squared error of an estimator can only decrease after conditioning on a sufficient statistic. We provide several applications of this insight, including a simplified derivation of the log-optimal e-variable for linear regression with known variance.

Rao-Blackwellized e-variables

TL;DR

The paper establishes a general Rao-Blackwellization principle for e-statistics, showing that conditioning an e-variable (or its sequential analogues) on a sufficient statistic can only improve, under any concave utility, the expected value across the parameter space. It extends the result to log-utility with weaker existence conditions, to e-processes via sufficient filtrations, and to compound and asymptotic e-variables, providing a versatile theoretical tool. Concrete applications demonstrate how the approach recovers optimal e-variables in fixed-design linear regression and Pareto data, and how permutation-based averaging can enhance GRO/GROW properties for i.i.d. data. Collectively, the results offer a unifying framework to design and analyze powerful e-statistics across static and sequential testing contexts.

Abstract

We show that for any concave utility, the expected utility of an e-variable can only increase after conditioning on a sufficient statistic. The simplest form of the result has an extremely straightforward proof, which follows from a single application of Jensen's inequality. Similar statements hold for compound e-variables, asymptotic e-variables, and e-processes. These results echo the Rao-Blackwell theorem, which states that the expected squared error of an estimator can only decrease after conditioning on a sufficient statistic. We provide several applications of this insight, including a simplified derivation of the log-optimal e-variable for linear regression with known variance.

Paper Structure

This paper contains 14 sections, 7 theorems, 55 equations.

Key Result

Theorem 2.1

Let $S$ be a sufficient statistic for $\Theta$. Let $E$ be an e-variable for $\Theta_0$ and set $G = \mathbb{E}_{\Theta}[E\mid S]$. Then $G$ is an e-variable and $\mathbb{E}_{\theta}[f(G)] \geq \mathbb{E}_{\theta}[f(E)]$ for any concave utility function $f$ and any $\theta\in\Theta$, assuming both s

Theorems & Definitions (19)

  • Theorem 2.1
  • proof
  • Remark 2.2
  • Remark 2.3
  • Example 2.4
  • Lemma 2.5
  • proof
  • Remark 2.6
  • Example 2.7
  • Theorem 2.8
  • ...and 9 more