Trust Me, I Know the Way: Predictive Uncertainty in the Presence of Shortcut Learning
Lisa Wimmer, Bernd Bischl, Ludwig Bothmann
TL;DR
The paper investigates how predictive uncertainty should be represented when shortcut learning drives spurious correlations. It leverages a Bayesian ensemble framework and the entropy decomposition $H(Y)=H(Y| ext{Θ})+I(Y; ext{Θ})$ to separate aleatoric and epistemic components and to interpret EU as model disagreement. Through MNIST-based experiments with CMNIST3 and PMNIST3 shortcuts, the authors show that shortcuts can provoke disagreement among ensemble members, yielding high EU on OOD data while shortcut-free data yield more diffuse uncertainty. The work provides a step toward reconciling ignorance and disagreement perspectives on EU and highlights the significant impact of distribution shifts on uncertainty quantification in practical settings.
Abstract
The correct way to quantify predictive uncertainty in neural networks remains a topic of active discussion. In particular, it is unclear whether the state-of-the art entropy decomposition leads to a meaningful representation of model, or epistemic, uncertainty (EU) in the light of a debate that pits ignorance against disagreement perspectives. We aim to reconcile the conflicting viewpoints by arguing that both are valid but arise from different learning situations. Notably, we show that the presence of shortcuts is decisive for EU manifesting as disagreement.
