Calibrating Expressions of Certainty
Peiqi Wang, Barbara D. Lam, Yingcheng Liu, Ameneh Asgari-Targhi, Rameswar Panda, William M. Wells, Tina Kapur, Polina Golland
TL;DR
This paper reframes certainty in predictions as distributions over the probability simplex, enabling a generalized calibration framework beyond scalar confidence scores. It extends the expected calibration error to distributional outputs, derives robust estimators, and use discrete optimal transport to construct interpretable calibration maps between certainty phrases. The authors validate the approach on radiologists and language models, showing that OT-based post-hoc calibration improves ECE and Brier scores while yielding actionable guidance (e.g., substitute phrases) for humans. The work provides a practical, distribution-level method for improving the reliability of natural language expressions of certainty in both medical and AI systems, with broad implications for decision-making and trust.
Abstract
We present a novel approach to calibrating linguistic expressions of certainty, e.g., "Maybe" and "Likely". Unlike prior work that assigns a single score to each certainty phrase, we model uncertainty as distributions over the simplex to capture their semantics more accurately. To accommodate this new representation of certainty, we generalize existing measures of miscalibration and introduce a novel post-hoc calibration method. Leveraging these tools, we analyze the calibration of both humans (e.g., radiologists) and computational models (e.g., language models) and provide interpretable suggestions to improve their calibration.
