Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

Yifei Wang; Yu Sheng; Linjing Li; Daniel Zeng

Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

Yifei Wang, Yu Sheng, Linjing Li, Daniel Zeng

TL;DR

Uncertainty Unveiled analyzes how increasing in-context demonstrations affects the trustworthiness of large language models (LLMs) under long-context ICL. It introduces a Bayesian uncertainty quantification framework that partitions total uncertainty ($TU$) into epistemic ($EU$) and aleatoric ($AU$) components and shows that additional in-context examples mainly reduce $EU$, enhancing performance by injecting task-specific knowledge. The study finds that benefits persist at large model scales but can be tempered for complex reasoning tasks by rising $AU$, and it reveals internal mechanisms via residual-stream projections and logit-margin amplification that explain the uncertainty reductions. Practically, the work suggests favoring diverse, information-rich demonstrations and provides interpretability directions to understand how inner confidences evolve during long-context ICL, with implications for deploying trustworthy prompting strategies in high-stakes settings.

Abstract

Recent advances in handling long sequences have facilitated the exploration of long-context in-context learning (ICL). While much of the existing research emphasizes performance improvements driven by additional in-context examples, the influence on the trustworthiness of generated responses remains underexplored. This paper addresses this gap by investigating how increased examples influence predictive uncertainty, an essential aspect in trustworthiness. We begin by systematically quantifying the uncertainty of ICL with varying shot counts, analyzing the impact of example quantity. Through uncertainty decomposition, we introduce a novel perspective on performance enhancement, with a focus on epistemic uncertainty (EU). Our results reveal that additional examples reduce total uncertainty in both simple and complex tasks by injecting task-specific knowledge, thereby diminishing EU and enhancing performance. For complex tasks, these advantages emerge only after addressing the increased noise and uncertainty associated with longer inputs. Finally, we explore the evolution of internal confidence across layers, unveiling the mechanisms driving the reduction in uncertainty.

Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

TL;DR

Abstract

Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (20)