Strengthening Interpretability: An Investigative Study of Integrated Gradient Methods
Shree Singhi, Anupriya Kumari
TL;DR
The paper conducts a thorough reproducibility study of Integrated Gradients (IG) and the Important Direction Gradient Integration (IDGI) framework, addressing theoretical claims and empirical performance. It provides a rigorous theoretical analysis, including a Taylor-series derivation of key quantities such as x_{j_p}, and demonstrates that IDGI can be more sensitive to the number of Riemann-sum steps than the underlying IG methods. Through extensive experiments on Imagenet with multiple models and metrics (Insertion Score, SIC, AIC, and MS-SSIM variants), the authors show that IDGI generally improves attribution quality for several baselines but not universally; certain architectures (e.g., residual networks) exhibit anomalies, and step-size significantly influences performance. The study also reveals that IDGI tends to enhance numerical stability and offers an implemented codebase to facilitate reproducibility, contributing valuable guidance for practitioners applying attribution methods in vision models.
Abstract
We conducted a reproducibility study on Integrated Gradients (IG) based methods and the Important Direction Gradient Integration (IDGI) framework. IDGI eliminates the explanation noise in each step of the computation of IG-based methods that use the Riemann Integration for integrated gradient computation. We perform a rigorous theoretical analysis of IDGI and raise a few critical questions that we later address through our study. We also experimentally verify the authors' claims concerning the performance of IDGI over IG-based methods. Additionally, we varied the number of steps used in the Riemann approximation, an essential parameter in all IG methods, and analyzed the corresponding change in results. We also studied the numerical instability of the attribution methods to check the consistency of the saliency maps produced. We developed the complete code to implement IDGI over the baseline IG methods and evaluated them using three metrics since the available code was insufficient for this study.
