Enabling Uncertainty Estimation in Iterative Neural Networks
Nikita Durasov, Doruk Oner, Jonathan Donier, Hieu Le, Pascal Fua
TL;DR
The paper tackles uncertainty quantification for iterative neural networks by exploiting convergence behavior of successive refinements. It defines per-output uncertainty as $U^i = \mathrm{Var}(\{\mathbf{y}_1^i, \ldots, \mathbf{y}_N^i\})$ and trains using $\mathcal{L}_{total} = \sum_{i=1}^{N} \mathcal{D}(\mathbf{y}_{i}, \mathbf{y}^{\rm gt})$ while keeping the architecture fixed. The approach delivers competitive calibration and uncertainty quality, often on par with Deep Ensembles but at a fraction of the computational cost, demonstrated in road delineation and aerodynamic shape optimization using Bayesian optimization. Overall, convergence speed serves as a robust proxy for predictive certainty, enabling fast, practical uncertainty estimates across diverse tasks without modifying model design.
Abstract
Turning pass-through network architectures into iterative ones, which use their own output as input, is a well-known approach for boosting performance. In this paper, we argue that such architectures offer an additional benefit: The convergence rate of their successive outputs is highly correlated with the accuracy of the value to which they converge. Thus, we can use the convergence rate as a useful proxy for uncertainty. This results in an approach to uncertainty estimation that provides state-of-the-art estimates at a much lower computational cost than techniques like Ensembles, and without requiring any modifications to the original iterative model. We demonstrate its practical value by embedding it in two application domains: road detection in aerial images and the estimation of aerodynamic properties of 2D and 3D shapes.
