Sanity Checks for Explanation Uncertainty

Matias Valdenegro-Toro; Mihir Mulye

Sanity Checks for Explanation Uncertainty

Matias Valdenegro-Toro, Mihir Mulye

TL;DR

This paper proposes sanity checks for uncertainty explanation methods, where a weight and data randomization tests are defined for explanations with uncertainty, allowing for quick tests to combinations of uncertainty and explanation methods.

Abstract

Explanations for machine learning models can be hard to interpret or be wrong. Combining an explanation method with an uncertainty estimation method produces explanation uncertainty. Evaluating explanation uncertainty is difficult. In this paper we propose sanity checks for uncertainty explanation methods, where a weight and data randomization tests are defined for explanations with uncertainty, allowing for quick tests to combinations of uncertainty and explanation methods. We experimentally show the validity and effectiveness of these tests on the CIFAR10 and California Housing datasets, noting that Ensembles seem to consistently pass both tests with Guided Backpropagation, Integrated Gradients, and LIME explanations.

Sanity Checks for Explanation Uncertainty

TL;DR

Abstract

Paper Structure (11 sections, 12 equations, 7 figures, 3 tables)

This paper contains 11 sections, 12 equations, 7 figures, 3 tables.

Introduction
Sanity Checks for Uncertain Explanations
Explanation Uncertainty
Sanity Checks
Experiments
Saliency Methods
Uncertainty Quantification Methods
CIFAR10 Classification
California Housing Regression
Discussion
Conclusions and Future Work

Figures (7)

Figure 1: Example explanation with uncertainty on a FER dataset image using Guided Backpropagation and Ensembles.
Figure 2: SSIM values for the weight randomization test on CIFAR 10 using Dropout. These values are computed between the explanation representation generated with no weight randomization and with incremental weight randomization. As the amount of weight randomization increases, the similarity between explanation representations decreases.
Figure 3: Visualization of weight randomization effect on the explanations obtained using Dropout and Integrated Gradients on a CIFAR10 sample. Note how the both mean and standard deviation explanation become almost blank with increasing random weights.
Figure 4: Visualization of weight randomization effect on the explanations obtained using Dropout and Guided Backpropagation on a CIFAR10 sample. Note how the mean and standard deviation explanation become noisy with increasing random weights.
Figure 5: Visualization of data randomization effect on explanations obtained using Dropout and GBP and IG. Note how the explanation for the model with random labels is considerably more noisy than explanation from a model with true labels.
...and 2 more figures

Sanity Checks for Explanation Uncertainty

TL;DR

Abstract

Sanity Checks for Explanation Uncertainty

Authors

TL;DR

Abstract

Table of Contents

Figures (7)