UPLME: Uncertainty-Aware Probabilistic Language Modelling for Robust Empathy Regression
Md Rakibul Hasan, Md Zakir Hossain, Aneesh Krishna, Shafin Rahman, Tom Gedeon
TL;DR
This work tackles the problem of noisy self‑reported empathy labels in regression tasks by introducing UPLME, an uncertainty‑aware probabilistic language modeling framework that predicts both empathy scores and input‑dependent uncertainty. The method uses a cross‑encoder backbone with two parallel regression heads and employs variational model ensembling (Monte Carlo dropout) to capture epistemic uncertainty, along with three losses: a beta‑NLL objective, a variance penalty, and an alignment loss that ties representations of paired texts to their empathic similarity. Empirical results on NewsEmp21, NewsEmp24, and EmpStories show state‑of‑the‑art regression performance and superior uncertainty calibration compared with recent baselines, including UCVME, without requiring external data cleaning or dual‑model consistency constraints. The approach demonstrates that explicit uncertainty modeling can effectively downweight noisy labels, improve calibration, and reveal a meaningful latent structure related to empathy, with practical implications for robust NLP in social‑psychology tasks.
Abstract
Noisy self-reported empathy scores challenge supervised learning for empathy regression. While many algorithms have been proposed for learning with noisy labels in textual classification problems, the regression counterpart is relatively under-explored. We propose UPLME, an uncertainty-aware probabilistic language modelling framework to capture label noise in empathy regression tasks. One of the novelties in UPLME is a probabilistic language model that predicts both empathy scores and heteroscedastic uncertainty, and is trained using Bayesian concepts with variational model ensembling. We further introduce two novel loss components: one penalises degenerate Uncertainty Quantification (UQ), and another enforces similarity between the input pairs on which empathy is being predicted. UPLME achieves state-of-the-art performance (Pearson Correlation Coefficient: $0.558\rightarrow0.580$ and $0.629\rightarrow0.634$) in terms of the performance reported in the literature on two public benchmarks with label noise. Through synthetic label noise injection, we demonstrate that UPLME is effective in distinguishing between noisy and clean samples based on the predicted uncertainty. UPLME further outperform (Calibration error: $0.571\rightarrow0.376$) a recent variational model ensembling-based UQ method designed for regression problems. Code is publicly available at https://github.com/hasan-rakibul/UPLME.
