REPEAT: Improving Uncertainty Estimation in Representation Learning Explainability
Kristoffer K. Wickstrøm, Thea Brüsch, Michael C. Kampffmeyer, Robert Jenssen
TL;DR
The paper tackles the lack of a true certainty measure for pixel importance in unsupervised representation learning (R-XAI). REPEAT models each pixel as a Bernoulli random variable with probability $p_{ij}$ of being important and derives uncertainty as $U_{ij}=p_{ij}(1-p_{ij})$ by aggregating $K$ thresholded samples from a base stochastic R-XAI method. Across multiple datasets and encoders, REPEAT achieves more intuitive certainty estimates, stronger out-of-distribution detection, and lower uncertainty complexity than state-of-the-art baselines, including RELAX and Kernel SHAP. The method is flexible and can deploy with different stochastic R-XAI bases, enabling safer, more transparent representations in unsupervised settings.
Abstract
Incorporating uncertainty is crucial to provide trustworthy explanations of deep learning models. Recent works have demonstrated how uncertainty modeling can be particularly important in the unsupervised field of representation learning explainable artificial intelligence (R-XAI). Current R-XAI methods provide uncertainty by measuring variability in the importance score. However, they fail to provide meaningful estimates of whether a pixel is certainly important or not. In this work, we propose a new R-XAI method called REPEAT that addresses the key question of whether or not a pixel is \textit{certainly} important. REPEAT leverages the stochasticity of current R-XAI methods to produce multiple estimates of importance, thus considering each pixel in an image as a Bernoulli random variable that is either important or unimportant. From these Bernoulli random variables we can directly estimate the importance of a pixel and its associated certainty, thus enabling users to determine certainty in pixel importance. Our extensive evaluation shows that REPEAT gives certainty estimates that are more intuitive, better at detecting out-of-distribution data, and more concise.
