Using Stratified Sampling to Improve LIME Image Explanations

Muhammad Rashid; Elvio G. Amparore; Enrico Ferrari; Damiano Verda

Using Stratified Sampling to Improve LIME Image Explanations

Muhammad Rashid, Elvio G. Amparore, Enrico Ferrari, Damiano Verda

TL;DR

This work derives all the formulas and adjustment factors required for an unbiased stratified sampling estimator for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, and highlights a connection with the Shapley theory.

Abstract

We investigate the use of a stratified sampling approach for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, in order to reduce the artifacts generated by typical Monte Carlo sampling. Such artifacts are due to the undersampling of the dependent variable in the synthetic neighborhood around the image being explained, which may result in inadequate explanations due to the impossibility of fitting a linear regressor on the sampled data. We then highlight a connection with the Shapley theory, where similar arguments about undersampling and sample relevance were suggested in the past. We derive all the formulas and adjustment factors required for an unbiased stratified sampling estimator. Experiments show the efficacy of the proposed approach.

Using Stratified Sampling to Improve LIME Image Explanations

TL;DR

Abstract

Paper Structure (16 sections, 18 equations, 7 figures, 1 algorithm)

This paper contains 16 sections, 18 equations, 7 figures, 1 algorithm.

Introduction
Paper Contributions.
Previous Work
Preliminaries
Interpretation of LIME.
Limitations of LIME Image Sampling
Under-Representation of the Neighborhood.
Dependent Variables Distribution.
Sample Relevance.
Proposed Methodology
The Mixture Model.
Impact of Stratified Sampling in LIME Image.
Experimental Evaluation
Conclusions
Availability
...and 1 more sections

Figures (7)

Figure 1: LIME Image workflow.
Figure 2: How LIME is supposed to work (A), and how it actually works (B) using Monte Carlo sampling for a large enough $k$.
Figure 3: Binomial (A) and Shapley weight (B) distributions for $k = 10, 20$ and $50$.
Figure 4: Dependent variable undersampling (low $RC(Y)$) results in confused explanations (low $CV(\beta)$).
Figure 5: Four explanations $\widehat{\beta}$ of the same image of Fig. \ref{['fig:hyenaMC']} using stratified sampling (each is an average of $10$ runs).
...and 2 more figures

Using Stratified Sampling to Improve LIME Image Explanations

TL;DR

Abstract

Using Stratified Sampling to Improve LIME Image Explanations

Authors

TL;DR

Abstract

Table of Contents

Figures (7)