Optimal E-Values for Exponential Families: the Simple Case
Peter Grünwald, Tyron Lardy, Yunda Hao, Shaul K. Bar-Lev, Martijn de Jong
TL;DR
This paper derives a general, checkable condition under which simple e-variables, in the form of a likelihood-ratio between a simple alternative $Q$ and a simple null $P_{\mu^*}$, exist for composite exponential-family nulls. By constructing an auxiliary exponential family $\mathcal{Q}$ sharing the same sufficient statistic as the null and analyzing the covariance structure via $\Sigma_p(\bm\mu)$ and $\Sigma_q(\bm\mu)$, the authors show that $q(U)/p_{\mu^*}(U)$ is a globally GRO e-variable whenever $\Sigma_p(\bm\mu)-\Sigma_q(\bm\mu)$ is positive semidefinite for all relevant $\bm\mu$. They establish eight equivalent conditions, including KL-divergence inequalities and log-partition function comparisons, and demonstrate the result across diverse settings: Gaussian location with shared or distinct covariance, Gaussian and Poisson k-sample tests, Bernoulli cases, Gaussian scale, NEFs, and a linear-regression model. The work unifies and extends prior results on simple e-variables, enabling easy computation and anytime-valid testing, and provides a framework for mixture-based composites and sequential testing. Practical impact lies in offering computable, GRO e-variables for a broad class of hypothesis tests in exponential-family models, with clear criteria for when they exist and how to construct them.
Abstract
We provide a general condition under which e-variables in the form of a simple-vs.-simple likelihood ratio exist when the null hypothesis is a composite, multivariate exponential family. Such `simple' e-variables are easy to compute and expected-log-optimal with respect to any stopping time. Simple e-variables were previously only known to exist in quite specific settings, but we offer a unifying theorem on their existence for testing exponential families. We start with a simple alternative $Q$ and a regular exponential family null. Together these induce a second exponential family ${\cal Q}$ containing $Q$, with the same sufficient statistic as the null. Our theorem shows that simple e-variables exist whenever the covariance matrices of ${\cal Q}$ and the null are in a certain relation. A prime example in which this relation holds is testing whether a parameter in a linear regression is 0. Other examples include some $k$-sample tests, Gaussian location- and scale tests, and tests for more general classes of natural exponential families. While in all these examples, the implicit composite alternative is also an exponential family, in general this is not required.
