Reproducibility review of "Why Not Other Classes": Towards Class-Contrastive Back-Propagation Explanations
Arvid Eriksson, Anton Israelsson, Mattias Kallhauge
TL;DR
This work provides a thorough reproducibility analysis of class-contrastive back-propagation explanations for neural image classifiers, extending the original approach to post-softmax backprop methods, XGradCAM, FullGrad, and Vision Transformers. By correcting mathematical gaps, validating across multiple backprop variants, and applying the method to ViTs with attention-rollout, the study demonstrates that contrastive explanations can better discriminate among competing classes and maintain generalizability. Key contributions include an open-source repository, clarified methodology, and empirical support for the claim that back-propagating from the softmax neuron $p_t$ yields more interpretable, targeted heatmaps than standard logits. The work highlights reproducibility challenges in explainability research and provides a practical, transferable framework for class-contrastive explanations with potential impact on debugging and model transparency in high-stakes settings.
Abstract
"Why Not Other Classes?": Towards Class-Contrastive Back-Propagation Explanations (Wang & Wang, 2022) provides a method for contrastively explaining why a certain class in a neural network image classifier is chosen above others. This method consists of using back-propagation-based explanation methods from after the softmax layer rather than before. Our work consists of reproducing the work in the original paper. We also provide extensions to the paper by evaluating the method on XGradCAM, FullGrad, and Vision Transformers to evaluate its generalization capabilities. The reproductions show similar results as the original paper, with the only difference being the visualization of heatmaps which could not be reproduced to look similar. The generalization seems to be generally good, with implementations working for Vision Transformers and alternative back-propagation methods. We also show that the original paper suffers from issues such as a lack of detail in the method and an erroneous equation which makes reproducibility difficult. To remedy this we provide an open-source repository containing all code used for this project.
