Multiple Different Black Box Explanations for Image Classifiers
Hana Chockler, David A. Kelly, Daniel Kroening
TL;DR
The paper tackles the limitation of single explanations for image classifier decisions by introducing MultEX, a black-box method grounded in actual causality to output multiple, diverse explanations for a given image. It models classifiers as probabilistic causal networks with pixel-level endogenous variables, defines explanations as minimal pixel subsets that guarantee the top-class decision, and proves the problem’s intractability while offering a practical, plug-in algorithm (CAUSAL_RANK, searchlight-based exploration, minimize, and separate) to approximate multiple explanations. Across ImageNet-1k, VOC2012, and ECSSD with ResNet50, ConvNext, and ViT-B-32, MultEX produces more explanations that are smaller and consistently tied to the top classification, and it remains robust to probability-threshold settings unlike Sag. The approach yields disjoint, localized explanations even on occluded images, enhancing interpretability and debugging capabilities for modern CNNs and transformer-based vision models, with code and data openly available.
Abstract
Existing explanation tools for image classifiers usually give only a single explanation for an image's classification. For many images, however, image classifiers accept more than one explanation for the image label. These explanations are useful for analyzing the decision process of the classifier and for detecting errors. Thus, restricting the number of explanations to just one severely limits insight into the behavior of the classifier. In this paper, we describe an algorithm and a tool, MultEX, for computing multiple explanations as the output of a black-box image classifier for a given image. Our algorithm uses a principled approach based on actual causality. We analyze its theoretical complexity and evaluate MultEX against the state-of-the-art across three different models and three different datasets. We find that MultEX finds more explanations and that these explanations are of higher quality.
