Probabilistic Skip Connections for Deterministic Uncertainty Quantification in Deep Neural Networks
Felix Jimenez, Matthias Katzfuss
TL;DR
This work introduces Probabilistic Skip Connections (PSCs) to enable deterministic uncertainty quantification in classification networks without retraining with spectral normalization. By locating an intermediate layer that simultaneously preserves feature sensitivity and smoothness via neural-collapse metrics, PSCs project and attach a probabilistic head to yield reliable iD and OOD assessments in a single forward pass. The method leverages a Tucker-based projection of intermediate representations and a linear probabilistic head (Laplace/KFAC) to provide accurate uncertainty estimates, matching or surpassing SN-trained baselines across architectures and datasets, including networks without residual connections. Empirical results also show the intermediate representations exhibit exploitable low-rank structure, allowing effective dimensionality reduction without eroding UQ quality. Overall, PSCs offer a drop-in, scalable path to high-quality UQ and OOD detection without retraining, expanding deterministic UQ to a broader class of networks.
Abstract
Deterministic uncertainty quantification (UQ) in deep learning aims to estimate uncertainty with a single pass through a network by leveraging outputs from the network's feature extractor. Existing methods require that the feature extractor be both sensitive and smooth, ensuring meaningful input changes produce meaningful changes in feature vectors. Smoothness enables generalization, while sensitivity prevents feature collapse, where distinct inputs are mapped to identical feature vectors. To meet these requirements, current deterministic methods often retrain networks with spectral normalization. Instead of modifying training, we propose using measures of neural collapse to identify an existing intermediate layer that is both sensitive and smooth. We then fit a probabilistic model to the feature vector of this intermediate layer, which we call a probabilistic skip connection (PSC). Through empirical analysis, we explore the impact of spectral normalization on neural collapse and demonstrate that PSCs can effectively disentangle aleatoric and epistemic uncertainty. Additionally, we show that PSCs achieve uncertainty quantification and out-of-distribution (OOD) detection performance that matches or exceeds existing single-pass methods requiring training modifications. By retrofitting existing models, PSCs enable high-quality UQ and OOD capabilities without retraining.
