Calibration Bands for Mean Estimates within the Exponential Dispersion Family
Łukasz Delong, Selim Gatti, Mario V. Wüthrich
TL;DR
This work develops non-asymptotic calibration bands for mean estimates within the exponential dispersion family (EDF), enabling tests of calibration and auto-calibration that operate for any sample size. By extending stochastic-order and convolution techniques from binary and sub-Gaussian settings to the whole EDF, the authors derive data-dependent bands built from weighted partial sums that hold with probability at least $1-\alpha$. They provide explicit closed-form bands for common discrete and continuous EDF members (e.g., binomial, Poisson, negative binomial, gamma, normal) and extend the construction to regression with a ranking function, along with methods to handle band crossings via isotonic recalibration. The approach is then demonstrated through numerical experiments and real-data applications, showing practical detection of calibration violations and robustness to dispersion parameter estimation, while highlighting computational strategies such as binning and selective pairing. The results offer a versatile non-asymptotic tool for assessing and testing calibration in EDF-based models with broad applicability in statistics and actuarial science.
Abstract
A statistical model is said to be calibrated if the resulting mean estimates perfectly match the true means of the underlying responses. Aiming for calibration is often not achievable in practice as one has to deal with finite samples of noisy observations. A weaker notion of calibration is auto-calibration. An auto-calibrated model satisfies that the expected value of the responses for a given mean estimate matches this estimate. Testing for autocalibration has only been considered recently in the literature and we propose a new approach based on calibration bands. Calibration bands denote a set of lower and upper bounds such that the probability that the true means lie simultaneously inside those bounds exceeds some given confidence level. Such bands were constructed by Yang-Barber (2019) for sub-Gaussian distributions. Dimitriadis et al. (2023) then introduced narrower bands for the Bernoulli distribution. We use the same idea in order to extend the construction to the entire exponential dispersion family that contains for example the binomial, Poisson, negative binomial, gamma and normal distributions. Moreover, we show that the obtained calibration bands allow us to construct various tests for calibration and auto-calibration, respectively. As the construction of the bands does not rely on asymptotic results, we emphasize that our tests can be used for any sample size.
