Uncertainty Quantification via Stable Distribution Propagation
Felix Petersen, Aashwin Mishra, Hilde Kuehne, Christian Borgelt, Oliver Deussen, Mikhail Yurochkin
TL;DR
This work tackles uncertainty quantification in neural networks by proposing Stable Distribution Propagation (SDP), a sampling-free method that propagates Gaussian and Cauchy input uncertainties through networks. SDP uses exact affine propagation and a total-variation-optimal local linearization for nonlinearities (notably ReLU) to deliver tractable, non-marginal distribution updates; it also extends to joint input/output uncertainty via Probabilistic Neural Networks. The authors demonstrate SDP's advantages over marginal moment matching and DVIA in both accuracy (TV/Wasserstein metrics) and computation, and show its utility for calibrated prediction intervals and out-of-distribution selective prediction, including applications on UCI datasets and MNIST/EMNIST with large networks. The method is compatible with pre-trained models and scalable to deep architectures, offering a practical tool for reliable uncertainty quantification in safety-critical settings.
Abstract
We propose a new approach for propagating stable probability distributions through neural networks. Our method is based on local linearization, which we show to be an optimal approximation in terms of total variation distance for the ReLU non-linearity. This allows propagating Gaussian and Cauchy input uncertainties through neural networks to quantify their output uncertainties. To demonstrate the utility of propagating distributions, we apply the proposed method to predicting calibrated confidence intervals and selective prediction on out-of-distribution data. The results demonstrate a broad applicability of propagating distributions and show the advantages of our method over other approaches such as moment matching.
