Mapping Clinical Doubt: Locating Linguistic Uncertainty in LLMs
Srivarshinee Sridhar, Raghav Kaushik Ravi, Kripabandhu Ghosh
TL;DR
The paper investigates how linguistic uncertainty, encoded through epistemic modality, is internally represented in LLMs. It introduces Model Sensitivity to Uncertainty (MSU), a layerwise activation-difference metric, and a 3,114-pair dataset to probe how uncertainty cues shift internal representations. Results show MSU increases with depth across multiple models, with later layers encoding more epistemic information and PCA analyses revealing clustering and geometric inversions in deep layers. These findings suggest a distributed, late-emergent encoding of epistemic cues, with important implications for interpretability and reliability in clinical contexts.
Abstract
Large Language Models (LLMs) are increasingly used in clinical settings, where sensitivity to linguistic uncertainty can influence diagnostic interpretation and decision-making. Yet little is known about where such epistemic cues are internally represented within these models. Distinct from uncertainty quantification, which measures output confidence, this work examines input-side representational sensitivity to linguistic uncertainty in medical text. We curate a contrastive dataset of clinical statements varying in epistemic modality (e.g., 'is consistent with' vs. 'may be consistent with') and propose Model Sensitivity to Uncertainty (MSU), a layerwise probing metric that quantifies activation-level shifts induced by uncertainty cues. Our results show that LLMs exhibit structured, depth-dependent sensitivity to clinical uncertainty, suggesting that epistemic information is progressively encoded in deeper layers. These findings reveal how linguistic uncertainty is internally represented in LLMs, offering insight into their interpretability and epistemic reliability.
