Bayes Risk for Goodness of Fit Tests
Nicholas G. Polson, Vadim Sokolov, Daniel Zantedeschi
TL;DR
This work reframes GOF testing under a Bayesian decision-theoretic lens and demonstrates that Bayes-risk optimization drives GOF calibrations to a moderate-deviation scale, yielding thresholds that inflate like $t_n^*\sim\sqrt{\frac{\kappa}{4\rho}\log n}$ and Type I errors that decay polynomially. It unifies KS-type empirical-process GOF, Sanov-based tests, and parametric GOF through a two-term risk decomposition governed by the tail parameter $\rho$ and the local prior mass exponent $\kappa$, connecting to Sanov information asymptotics and Fisher information geometry. The framework is instantiated across multiple GOF settings—location testing under Laplace models, shape testing via Bayes factors, multinomial GOF, and contingency-table independence tests—with numerical verification confirming the MDp calibration. The results offer principled, sample-size-dependent calibration guidelines and illuminate practical implications for statistical practice in scientific and regulatory contexts.
Abstract
We develop a unified framework for goodness-of-fit (GOF) testing through the lens of Bayes risk. Classical GOF procedures are commonly calibrated either at fixed significance level (CLT scale) or through exponential error exponents (LDP scale). We establish that Bayes-risk optimal calibration operates on the moderate-deviation (MDP) scale, producing canonical $\sqrt{\log n}$ inflation of rejection thresholds and polynomially decaying Type I error. Our main contributions are: (i) we formalise the Rubin--Sethuraman program for KS-type statistics as a risk-calibration theorem with explicit regularity conditions on priors and empirical-process functionals; (ii) we develop the precise connection between Bayes-risk expansions and Sanov information asymptotics, showing how $\log n$-order truncations arise naturally when risk, rather than pure exponents, is the evaluation criterion; (iii) we provide detailed applications to location testing under Laplace families, shape testing via Bayes factors, and connections to Fisher information geometry. The organizing principle throughout is that sample size enters Bayes-optimal GOF cutoffs through the MDP scale, unifying KS-based and Sanov-based perspectives under a single risk criterion.
