Fine-Grained Uncertainty Decomposition in Large Language Models: A Spectral Approach
Nassim Walha, Sebastian G. Gruber, Thomas Decker, Yinchong Yang, Alireza Javanmardi, Eyke Hüllermeier, Florian Buettner
TL;DR
The paper introduces Spectral Uncertainty, a theoretically grounded framework that decomposes total predictive uncertainty in LLMs into aleatoric and epistemic components using kernel-based von Neumann entropy within an RKHS. It establishes a general uncertainty decomposition via functional Bregman information, specializes it to the von Neumann entropy, and provides finite-sum spectral estimators to compute total, aleatoric, and epistemic uncertainty from continuous semantic representations. Through a two-stage sampling workflow that generates input clarifications and model answers, the method achieves state-of-the-art performance on ambiguity detection (aleatoric) and correctness prediction (total uncertainty) across multiple datasets and models, outperforming semantic- and clustering-based baselines. While computationally intensive due to n×m outputs, Spectral Uncertainty offers a principled and practical approach to uncertainty in LLMs with strong implications for reliability and interpretability in real-world AI systems.
Abstract
As Large Language Models (LLMs) are increasingly integrated in diverse applications, obtaining reliable measures of their predictive uncertainty has become critically important. A precise distinction between aleatoric uncertainty, arising from inherent ambiguities within input data, and epistemic uncertainty, originating exclusively from model limitations, is essential to effectively address each uncertainty source. In this paper, we introduce Spectral Uncertainty, a novel approach to quantifying and decomposing uncertainties in LLMs. Leveraging the Von Neumann entropy from quantum information theory, Spectral Uncertainty provides a rigorous theoretical foundation for separating total uncertainty into distinct aleatoric and epistemic components. Unlike existing baseline methods, our approach incorporates a fine-grained representation of semantic similarity, enabling nuanced differentiation among various semantic interpretations in model responses. Empirical evaluations demonstrate that Spectral Uncertainty outperforms state-of-the-art methods in estimating both aleatoric and total uncertainty across diverse models and benchmark datasets.
