Probabilities Are All You Need: A Probability-Only Approach to Uncertainty Estimation in Large Language Models
Manh Nguyen, Sunil Gupta, Hung Le
TL;DR
This paper tackles hallucinations in large language models by proposing PRO, a training-free uncertainty estimator that approximates the predictive entropy $H(Y|x)$ using the top-$K$ generation probabilities and an adaptive threshold $\alpha$ to filter low-probability outputs. The authors derive a lower bound $PRO(x) = -\log p^*_K - \sum_{i=1}^{K} p^*_i \log \frac{p^*_i}{p^*_K}$ and demonstrate that PRO improves uncertainty estimation across TriviaQA, SciQ, and Natural Questions datasets using multiple models, often outperforming more expensive baselines. The approach emphasizes simplicity and computational efficiency by relying solely on token probabilities, avoiding semantic embeddings or internal-state analyses. Empirical results show robust improvements in AUROC for uncertainty estimation, supporting PRO as a practical tool for enhancing the trustworthiness of LLM outputs. Limitations include dependence on token logits and lack of semantic consideration, suggesting avenues for integrating semantics and exploring alternative decoding strategies in future work.
Abstract
Large Language Models (LLMs) exhibit strong performance across various natural language processing (NLP) tasks but remain vulnerable to hallucinations, generating factually incorrect or misleading outputs. Uncertainty estimation, often using predictive entropy estimation, is key to addressing this issue. However, existing methods often require multiple samples or extra computation to assess semantic entropy. This paper proposes an efficient, training-free uncertainty estimation method that approximates predictive entropy using the responses' top-$K$ probabilities. Moreover, we employ an adaptive mechanism to determine $K$ to enhance flexibility and filter out low-confidence probabilities. Experimental results on three free-form question-answering datasets across several LLMs demonstrate that our method outperforms expensive state-of-the-art baselines, contributing to the broader goal of enhancing LLM trustworthiness.
