Calibrating LLMs with Information-Theoretic Evidential Deep Learning
Yawei Li, David Rügamer, Bernd Bischl, Mina Rezaei
TL;DR
Fine-tuned LLMs often exhibit overconfidence on small datasets, undermining trust in their uncertainty estimates. The authors introduce IB-EDL, which regularizes Evidential Deep Learning with an information bottleneck objective to suppress spurious evidence while preserving predictive information, achieving calibration gains with minimal overhead. By selecting the pre-evidence $Z=\tilde{\mathbf{e}}$ and using a Gaussian prior over $Z$, they derive a tractable IB objective that includes an $\ell_2$-regularization effect on the Dirichlet parameters, and show VB-based EDL methods as special cases within IB-EDL. Extensive experiments across multiple LLMs, datasets, and scenarios (ID calibration, OOD detection, and label-noise robustness) demonstrate substantial improvements in ECE and NLL without sacrificing accuracy, enabling more trustworthy deployment of LLMs in high-stakes domains.
Abstract
Fine-tuned large language models (LLMs) often exhibit overconfidence, particularly when trained on small datasets, resulting in poor calibration and inaccurate uncertainty estimates. Evidential Deep Learning (EDL), an uncertainty-aware approach, enables uncertainty estimation in a single forward pass, making it a promising method for calibrating fine-tuned LLMs. However, despite its computational efficiency, EDL is prone to overfitting, as its training objective can result in overly concentrated probability distributions. To mitigate this, we propose regularizing EDL by incorporating an information bottleneck (IB). Our approach IB-EDL suppresses spurious information in the evidence generated by the model and encourages truly predictive information to influence both the predictions and uncertainty estimates. Extensive experiments across various fine-tuned LLMs and tasks demonstrate that IB-EDL outperforms both existing EDL and non-EDL approaches. By improving the trustworthiness of LLMs, IB-EDL facilitates their broader adoption in domains requiring high levels of confidence calibration. Code is available at https://github.com/sandylaker/ib-edl.
