Model-agnostic variable importance for predictive uncertainty: an entropy-based approach
Danny Wood, Theodore Papamarkou, Matt Benatan, Richard Allmendinger
TL;DR
This paper addresses the need to explain not only model predictions but also the uncertainty of those predictions in probabilistic, uncertainty-aware models. It introduces model-agnostic adaptations of permutation feature importance, partial dependence plots, and individual conditional expectations to quantify how features affect the predictive distribution’s likelihood and entropy, via Likelihood-PFI/Entropy-PFI and their PDP/ICE counterparts. The authors establish theoretical properties, discuss interpretation, and demonstrate how these measures reveal when features share information, when uncertainty arises from extrapolation, and how this impacts model performance, using synthetic and real-world datasets in classification and regression. The work offers practical, interpretable diagnostics for uncertainty sources that complement existing explainability approaches, with open-source code to facilitate adoption. Overall, the approach advances trustworthy AI by enabling nuanced, distribution-focused explanations that are agnostic to the underlying model.
Abstract
In order to trust the predictions of a machine learning algorithm, it is necessary to understand the factors that contribute to those predictions. In the case of probabilistic and uncertainty-aware models, it is necessary to understand not only the reasons for the predictions themselves, but also the reasons for the model's level of confidence in those predictions. In this paper, we show how existing methods in explainability can be extended to uncertainty-aware models and how such extensions can be used to understand the sources of uncertainty in a model's predictive distribution. In particular, by adapting permutation feature importance, partial dependence plots, and individual conditional expectation plots, we demonstrate that novel insights into model behaviour may be obtained and that these methods can be used to measure the impact of features on both the entropy of the predictive distribution and the log-likelihood of the ground truth labels under that distribution. With experiments using both synthetic and real-world data, we demonstrate the utility of these approaches to understand both the sources of uncertainty and their impact on model performance.
