A distribution-free valid p-value for finite samples of bounded random variables
Joaquin Alvarez
TL;DR
The paper develops distribution-free, valid p-values (super-uniform) for testing whether the mean loss $R=\mathbb{E}[L_i]$ exceeds a threshold $\alpha$, using only the boundedness $L_i\in[0,1]$. It builds on a Bernstein-Hoeffding-type concentration bound from Pelekis, Ramon and Wang to define a function $g(t;R)$ and a derived $\gamma(R)$ that yield a PRW valid p-value $p_{\text{PRW}} = g(\min\{\hat{R}, (\gamma(\alpha)-1)/n\}; \alpha)$. The main result shows that this p-value is valid for testing $H_0: R>\alpha$ vs $H_1: R\le\alpha$, with monotonicity properties and a well-defined inverse mapping $g^{-1}$ to facilitate calibration. The work also discusses comparisons to Bentkus and Hoeffding-based valid p-values and highlights applicability to FWER-controlling procedures and distribution-free uncertainty quantification in predictive inference contexts.
Abstract
We build a valid p-value based on a concentration inequality for bounded random variables introduced by Pelekis, Ramon and Wang. The motivation behind this work is the calibration of predictive algorithms in a distribution-free setting. The super-uniform p-value is tighter than Hoeffding and Bentkus alternatives in certain regions. Even though we are motivated by a calibration setting in a machine learning context, the ideas presented in this work are also relevant in classical statistical inference. Furthermore, we compare the power of a collection of valid p- values for bounded losses, which are presented in previous literature.
