Table of Contents
Fetching ...

Using Stratified Sampling to Improve LIME Image Explanations

Muhammad Rashid, Elvio G. Amparore, Enrico Ferrari, Damiano Verda

TL;DR

This work derives all the formulas and adjustment factors required for an unbiased stratified sampling estimator for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, and highlights a connection with the Shapley theory.

Abstract

We investigate the use of a stratified sampling approach for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, in order to reduce the artifacts generated by typical Monte Carlo sampling. Such artifacts are due to the undersampling of the dependent variable in the synthetic neighborhood around the image being explained, which may result in inadequate explanations due to the impossibility of fitting a linear regressor on the sampled data. We then highlight a connection with the Shapley theory, where similar arguments about undersampling and sample relevance were suggested in the past. We derive all the formulas and adjustment factors required for an unbiased stratified sampling estimator. Experiments show the efficacy of the proposed approach.

Using Stratified Sampling to Improve LIME Image Explanations

TL;DR

This work derives all the formulas and adjustment factors required for an unbiased stratified sampling estimator for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, and highlights a connection with the Shapley theory.

Abstract

We investigate the use of a stratified sampling approach for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, in order to reduce the artifacts generated by typical Monte Carlo sampling. Such artifacts are due to the undersampling of the dependent variable in the synthetic neighborhood around the image being explained, which may result in inadequate explanations due to the impossibility of fitting a linear regressor on the sampled data. We then highlight a connection with the Shapley theory, where similar arguments about undersampling and sample relevance were suggested in the past. We derive all the formulas and adjustment factors required for an unbiased stratified sampling estimator. Experiments show the efficacy of the proposed approach.
Paper Structure (16 sections, 18 equations, 7 figures, 1 algorithm)

This paper contains 16 sections, 18 equations, 7 figures, 1 algorithm.

Figures (7)

  • Figure 1: LIME Image workflow.
  • Figure 2: How LIME is supposed to work (A), and how it actually works (B) using Monte Carlo sampling for a large enough $k$.
  • Figure 3: Binomial (A) and Shapley weight (B) distributions for $k = 10, 20$ and $50$.
  • Figure 4: Dependent variable undersampling (low $RC(Y)$) results in confused explanations (low $CV(\beta)$).
  • Figure 5: Four explanations $\widehat{\beta}$ of the same image of Fig. \ref{['fig:hyenaMC']} using stratified sampling (each is an average of $10$ runs).
  • ...and 2 more figures