Epistemic Neural Networks
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy
TL;DR
The paper tackles the difficulty of reliable decision making under uncertainty by focusing on joint predictions rather than marginal predictions. It introduces epistemic neural networks (ENNs) as a broad function-space interface for expressing epistemic uncertainty, and presents the epinet, a lightweight architecture that augments any base neural network to yield high-quality joint predictions with substantially less computation than large ensembles. Theoretical results show ENNs subsume BNNs while enabling richer uncertainty representations, and empirical results on Neural Testbed and ImageNet demonstrate that epinet dramatically improves joint log-loss without sacrificing marginal performance, even when starting from pretrained models. This approach offers a practical pathway to calibrate and deploy uncertainty-aware models at scale, with public code to facilitate adoption and evaluation.
Abstract
Intelligence relies on an agent's knowledge of what it does not know. This capability can be assessed based on the quality of joint predictions of labels across multiple inputs. In principle, ensemble-based approaches produce effective joint predictions, but the computational costs of training large ensembles can become prohibitive. We introduce the epinet: an architecture that can supplement any conventional neural network, including large pretrained models, and can be trained with modest incremental computation to estimate uncertainty. With an epinet, conventional neural networks outperform very large ensembles, consisting of hundreds or more particles, with orders of magnitude less computation. The epinet does not fit the traditional framework of Bayesian neural networks. To accommodate development of approaches beyond BNNs, such as the epinet, we introduce the epistemic neural network (ENN) as an interface for models that produce joint predictions.
