Quantifying the Prediction Uncertainty of Machine Learning Models for Individual Data
Koby Bibas
TL;DR
The thesis develops an information-theoretic framework for quantifying prediction uncertainty of machine learning models on individual data using predictive normalized maximum likelihood (pNML). It derives explicit pNML solutions for ordinary and over-parameterized linear regression, introduces a luckiness-enhanced variant (LpNML), and analyzes learnable subspaces where pNML performs well, even without distributional assumptions. Extending to neural networks, it provides analytical pNML formulations for single-layer and last-layer adaptations, enabling confidence estimates and improved OOD detection, while also proposing Adversarial pNML as a defense and a Deep Individual Active Learning (DIAL) strategy. Across experiments on UCI, MNIST, CIFAR, and ImageNet, the approach yields accurate uncertainty quantification, better OOD detection, targeted adversarial defense gains, and sample-efficient active learning, often outperforming strong baselines. Overall, the work offers distribution-free, point-wise uncertainty measures with practical benefits for robustness, open-set recognition, and active data acquisition in modern ML systems.
Abstract
Machine learning models have exhibited exceptional results in various domains. The most prevalent approach for learning is the empirical risk minimizer (ERM), which adapts the model's weights to reduce the loss on a training set and subsequently leverages these weights to predict the label for new test data. Nonetheless, ERM makes the assumption that the test distribution is similar to the training distribution, which may not always hold in real-world situations. In contrast, the predictive normalized maximum likelihood (pNML) was proposed as a min-max solution for the individual setting where no assumptions are made on the distribution of the tested input. This study investigates pNML's learnability for linear regression and neural networks, and demonstrates that pNML can improve the performance and robustness of these models on various tasks. Moreover, the pNML provides an accurate confidence measure for its output, showcasing state-of-the-art results for out-of-distribution detection, resistance to adversarial attacks, and active learning.
