Identifying Drivers of Predictive Aleatoric Uncertainty
Pascal Iversen, Simon Witzke, Katharina Baum, Bernhard Y. Renard
TL;DR
This paper tackles explaining predictive aleatoric uncertainty by extending regression to output a Gaussian predictive distribution $y|\mathbf{x} \sim \mathcal{N}(\hat{\mu}_{\mathbf{x}}, \hat{\sigma}^2_{\mathbf{x}})$ and applying post-hoc variance explanations to elucidate drivers of irreducible noise. It introduces Variance Feature Attribution (VFA) flavors (e.g., VFA-SHAP, VFA-IG, VFA-LRP, VFA-DeepSHAP) and benchmarks them against CLUE and InfoSHAP across synthetic tabular data and an image dataset MNIST+U, using adapted XAI metrics for uncertainty explanations. The evaluation shows that VFA methods, particularly VFA-SHAP, generally outperform alternatives in identifying uncertainty drivers, with MNIST+U confirming variance-focused attributions align with ground-truth uncertainty masks. The work provides a scalable, minimally invasive approach to explain uncertainty in regression and offers a benchmark suite and evaluation protocol to advance uncertainty explanation research, with potential impact on risk-aware decision-making in real-world AI systems.
Abstract
Explainability and uncertainty quantification are key to trustable artificial intelligence. However, the reasoning behind uncertainty estimates is generally left unexplained. Identifying the drivers of uncertainty complements explanations of point predictions in recognizing model limitations and enhancing transparent decision-making. So far, explanations of uncertainties have been rarely studied. The few exceptions rely on Bayesian neural networks or technically intricate approaches, such as auxiliary generative models, thereby hindering their broad adoption. We propose a straightforward approach to explain predictive aleatoric uncertainties. We estimate uncertainty in regression as predictive variance by adapting a neural network with a Gaussian output distribution. Subsequently, we apply out-of-the-box explainers to the model's variance output. This approach can explain uncertainty influences more reliably than complex published approaches, which we demonstrate in a synthetic setting with a known data-generating process. We substantiate our findings with a nuanced, quantitative benchmark including synthetic and real, tabular and image datasets. For this, we adapt metrics from conventional XAI research to uncertainty explanations. Overall, the proposed method explains uncertainty estimates with little modifications to the model architecture and outperforms more intricate methods in most settings.
