A Geometric Explanation of the Likelihood OOD Detection Paradox
Hamidreza Kamkari, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini, Rahul G. Krishnan, Gabriel Loaiza-Ganem
TL;DR
This work identifies a geometric explanation for why likelihood-based DGMs assign high density to OOD data yet fail to generate such samples, attributing it to regions of high density that have low probability mass when the OOD data lie on low-dimensional manifolds. It introduces Local Intrinsic Dimension (LID) as a diagnostic of these regions and pairs LID estimates with log-likelihoods from pre-trained normalizing flows or diffusion models to form a dual-threshold OOD detector. The proposed method achieves or matches state-of-the-art OOD detection performance on the same backbones across a range of datasets, while providing a principled, unsupervised approach that generalizes beyond a single model family. This work advances both the theoretical understanding of likelihood pathologies and a practical, backbone-agnostic tool for robust OOD detection in real-world systems. It also highlights avenues for improving LID estimation in diffusion models and extending the approach to broader classes of DGMs in future research.
Abstract
Likelihood-based deep generative models (DGMs) commonly exhibit a puzzling behaviour: when trained on a relatively complex dataset, they assign higher likelihood values to out-of-distribution (OOD) data from simpler sources. Adding to the mystery, OOD samples are never generated by these DGMs despite having higher likelihoods. This two-pronged paradox has yet to be conclusively explained, making likelihood-based OOD detection unreliable. Our primary observation is that high-likelihood regions will not be generated if they contain minimal probability mass. We demonstrate how this seeming contradiction of large densities yet low probability mass can occur around data confined to low-dimensional manifolds. We also show that this scenario can be identified through local intrinsic dimension (LID) estimation, and propose a method for OOD detection which pairs the likelihoods and LID estimates obtained from a pre-trained DGM. Our method can be applied to normalizing flows and score-based diffusion models, and obtains results which match or surpass state-of-the-art OOD detection benchmarks using the same DGM backbones. Our code is available at https://github.com/layer6ai-labs/dgm_ood_detection.
