Epistemic Errors of Imperfect Multitask Learners When Distributions Shift
Sabina J. Sloman, Michele Caprio, Samuel Kaski
TL;DR
This paper tackles how uncertainty-aware learners incur epistemic errors when faced with distribution shift in a multitask setting. It introduces a principled, decomposable epistemic error bound that separates model restrictions, data scarcity, and distribution shift as distinct sources of reducible error, formalized via total variation distance. The main result extends to special cases such as Bayesian transfer learning and total variation neighborhoods, and the authors discuss negative transfer and practical computability of the bound terms. The framework provides a diagnostic tool for when and how to reduce epistemic error by choosing appropriate inductive biases, data acquisition, or robust training strategies, with implications for uncertainty quantification in high-stakes domains. Limitations and future work point to extending the theory to conformal prediction and validating the bounds in more complex, real-world settings across time and higher dimensions.
Abstract
Uncertainty-aware machine learners, such as Bayesian neural networks, output a quantification of uncertainty instead of a point prediction. In this work, we provide uncertainty-aware learners with a principled framework to characterize, and identify ways to eliminate, errors that arise from reducible (epistemic) uncertainty. We introduce a principled definition of epistemic error, and provide a decompositional epistemic error bound which operates in the very general setting of imperfect multitask learning under distribution shift. In this setting, the training (source) data may arise from multiple tasks, the test (target) data may differ systematically from the source data tasks, and/or the learner may not arrive at an accurate characterization of the source data. Our bound separately attributes epistemic errors to each of multiple aspects of the learning procedure and environment. As corollaries of the general result, we provide epistemic error bounds specialized to the settings of Bayesian transfer learning and distribution shift within $ε$-neighborhoods. We additionally leverage the terms in our bound to provide a novel definition of negative transfer.
