Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways
Paloma Rabaey, Jong Hak Moon, Jung-Oh Lee, Min Gwan Kim, Hangyul Yoon, Thomas Demeester, Edward Choi
TL;DR
This work addresses the challenge of uncertainty in radiology reports by separating explicit hedging-based uncertainty from implicit diagnostic reasoning. It introduces a two-part framework: (i) explicit uncertainty is quantified by constructing an expert-validated, LLM-driven reference ranking of hedging phrases and mapping sentence-level hedging to a continuous probability for each finding; (ii) implicit uncertainty is modeled through a Pathway Expansion Framework that expands reports along expert-defined diagnostic pathways for 14 chest X-ray diagnoses, reconstructing omitted sub-findings. The authors release Lunguage++, an uncertainty-aware extension of the Lunguage dataset, which enables uncertainty-aware image classification, faithful diagnostic reasoning, and investigations into how diagnostic uncertainty impacts clinical outcomes. Methodologically, the paper combines hedging phrase extraction, TrueSkill-based ranking, LLM judgments, expert validation, and a rule-based expansion grounded in DAG-structured diagnostic pathways, producing a richer, more interpretable resource for uncertainty-aware radiology AI and evaluation. The work offers practical implications for training and evaluating uncertainty-aware models and provides reusable resources and code for community adoption via Github and Physionet.
Abstract
Radiology reports are invaluable for clinical decision-making and hold great potential for automated analysis when structured into machine-readable formats. These reports often contain uncertainty, which we categorize into two distinct types: (i) Explicit uncertainty reflects doubt about the presence or absence of findings, conveyed through hedging phrases. These vary in meaning depending on the context, making rule-based systems insufficient to quantify the level of uncertainty for specific findings; (ii) Implicit uncertainty arises when radiologists omit parts of their reasoning, recording only key findings or diagnoses. Here, it is often unclear whether omitted findings are truly absent or simply unmentioned for brevity. We address these challenges with a two-part framework. We quantify explicit uncertainty by creating an expert-validated, LLM-based reference ranking of common hedging phrases, and mapping each finding to a probability value based on this reference. In addition, we model implicit uncertainty through an expansion framework that systematically adds characteristic sub-findings derived from expert-defined diagnostic pathways for 14 common diagnoses. Using these methods, we release Lunguage++, an expanded, uncertainty-aware version of the Lunguage benchmark of fine-grained structured radiology reports. This enriched resource enables uncertainty-aware image classification, faithful diagnostic reasoning, and new investigations into the clinical impact of diagnostic uncertainty.
