Measuring Stochastic Data Complexity with Boltzmann Influence Functions
Nathan Ng, Roger Grosse, Marzyeh Ghassemi
TL;DR
This paper tackles uncertainty quantification under distribution shifts by reframing predictive uncertainty through the Minimum Description Length (MDL) lens, specifically using predictive normalized maximum likelihood (pNML). It introduces IF-COMP, a scalable approximation that employs Boltzmann influence functions (BIFs) to linearize models and estimate hindsight-optimal outputs and stochastic data complexity for both labeled and unlabeled data. The method achieves calibrated predictions and competitive complexity estimates while delivering substantial speedups over existing pNML-based approaches, and it demonstrates strong performance across uncertainty calibration, mislabel detection, and OOD detection benchmarks. Overall, the work highlights the practical viability of MDL-based uncertainty estimation in deep networks and provides a framework that blends theory with efficient, empirically validated algorithms for reliability under distribution shifts.
Abstract
Estimating the uncertainty of a model's prediction on a test point is a crucial part of ensuring reliability and calibration under distribution shifts. A minimum description length approach to this problem uses the predictive normalized maximum likelihood (pNML) distribution, which considers every possible label for a data point, and decreases confidence in a prediction if other labels are also consistent with the model and training data. In this work we propose IF-COMP, a scalable and efficient approximation of the pNML distribution that linearizes the model with a temperature-scaled Boltzmann influence function. IF-COMP can be used to produce well-calibrated predictions on test points as well as measure complexity in both labelled and unlabelled settings. We experimentally validate IF-COMP on uncertainty calibration, mislabel detection, and OOD detection tasks, where it consistently matches or beats strong baseline methods.
