Use HiResCAM instead of Grad-CAM for faithful explanations of convolutional neural networks
Rachel Lea Draelos, Lawrence Carin
TL;DR
The paper identifies a fundamental fidelity gap in Grad-CAM due to gradient averaging and introduces HiResCAM, a class-specific explanation method that faithfully reflects the regions the model actually uses to make predictions. It proves HiResCAM generalizes CAM and provides theoretical connections to regression and Gradient × Input, with rigorous results for CNNs ending in a single fully connected layer and for CAM architectures. Empirically, HiResCAM yields explanations that align with model computations on PASCAL VOC 2012, while Grad-CAM often expands attention; human studies and medical imaging examples further illustrate the trade-off between fidelity and segmentation utility. The work argues for adopting HiResCAM in sensitive contexts where trustworthy explanations are essential, while noting Grad-CAM's practical value for weakly supervised segmentation tasks.
Abstract
Explanation methods facilitate the development of models that learn meaningful concepts and avoid exploiting spurious correlations. We illustrate a previously unrecognized limitation of the popular neural network explanation method Grad-CAM: as a side effect of the gradient averaging step, Grad-CAM sometimes highlights locations the model did not actually use. To solve this problem, we propose HiResCAM, a novel class-specific explanation method that is guaranteed to highlight only the locations the model used to make each prediction. We prove that HiResCAM is a generalization of CAM and explore the relationships between HiResCAM and other gradient-based explanation methods. Experiments on PASCAL VOC 2012, including crowd-sourced evaluations, illustrate that while HiResCAM's explanations faithfully reflect the model, Grad-CAM often expands the attention to create bigger and smoother visualizations. Overall, this work advances convolutional neural network explanation approaches and may aid in the development of trustworthy models for sensitive applications.
