Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity
Mouïn Ben Ammar, David Brellmann, Arturo Mendoza, Antoine Manzanera, Gianni Franchi
TL;DR
The work shows that double descent, previously studied for ID generalization, also appears in post-hoc OOD detection across CNNs and transformers. It combines theory via random matrix theory in a Gaussian covariate model with extensive empirical validation on diverse architectures and OOD datasets, revealing that the OOD risk peaks near the interpolation threshold and that overparameterization is not universally beneficial. A Neural Collapse–based NC1 criterion is proposed to identify regimes where smaller models may outperform larger ones for OOD detection. The findings highlight the critical role of latent representation geometry in OOD detection and offer practical guidance for selecting model complexity in open-world settings.
Abstract
Out-of-distribution (OOD) detection is essential for ensuring the reliability and safety of machine learning systems. In recent years, it has received increasing attention, particularly through post-hoc detection and training-based methods. In this paper, we focus on post-hoc OOD detection, which enables identifying OOD samples without altering the model's training procedure or objective. Our primary goal is to investigate the relationship between model capacity and its OOD detection performance. Specifically, we aim to answer the following question: Does the Double Descent phenomenon manifest in post-hoc OOD detection? This question is crucial, as it can reveal whether overparameterization, which is already known to benefit generalization, can also enhance OOD detection. Despite the growing interest in these topics by the classic supervised machine learning community, this intersection remains unexplored for OOD detection. We empirically demonstrate that the Double Descent effect does indeed appear in post-hoc OOD detection. Furthermore, we provide theoretical insights to explain why this phenomenon emerges in such setting. Finally, we show that the overparameterized regime does not yield superior results consistently, and we propose a method to identify the optimal regime for OOD detection based on our observations.
