A View on Out-of-Distribution Identification from a Statistical Testing Theory Perspective
Alberto Caron, Chris Hicks, Vasilios Mavroudis
TL;DR
The paper reframes out-of-distribution detection as a non-parametric statistical testing problem and establishes identifiability conditions that determine when OOD can be reliably detected. It introduces a Wasserstein distance-based test statistic and proves both asymptotic uniform consistency under a separation condition and non-asymptotic power bounds, clarifying the limits of detectability. The analysis argues for the advantages of distributional-distance tests over KL/JS-based methods, especially in high-dimensional and non-overlapping regimes. Two experiments, a synthetic generative-model task and an MNIST versus Fashion-MNIST setup, demonstrate the practical effectiveness of the Wasserstein OOD test for detecting distributional shifts at test time.
Abstract
We study the problem of efficiently detecting Out-of-Distribution (OOD) samples at test time in supervised and unsupervised learning contexts. While ML models are typically trained under the assumption that training and test data stem from the same distribution, this is often not the case in realistic settings, thus reliably detecting distribution shifts is crucial at deployment. We re-formulate the OOD problem under the lenses of statistical testing and then discuss conditions that render the OOD problem identifiable in statistical terms. Building on this framework, we study convergence guarantees of an OOD test based on the Wasserstein distance, and provide a simple empirical evaluation.
