Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation
Tillmann Rheude, Andreas Wirtz, Arjan Kuijper, Stefan Wesarg
TL;DR
The paper tackles the explainability of CNN-based semantic segmentation, with emphasis on medical imaging where locating decision-relevant regions is crucial. It introduces Seg-HiRes-Grad CAM, a transfer of the classification-based HiRes CAM to segmentation by computing heatmaps from the deepest feature maps using a pixel-set $\mathcal{M}$, i.e., $L^c_{Seg-HiRes-Grad CAM} = sum_k \alpha_c^k A^k$, where $\alpha_c^k = \frac{y^{c, new}}{∂A^k}$ and $y^{c, new} = sum_{i,j \in \mathcal{M}} y^c_{i,j}$. This approach preserves gradient-based weighting while enabling segmentation-specific aggregation, improving saliency localization over Seg-Grad CAM. Evaluations on medical and non-medical datasets show more accurate saliency localization, though runtime and resolution constraints remain; the work motivates applying additional classification-based CAM transfers and richer quantitative comparisons for deeper evaluation of explainability in medical image segmentation.
Abstract
Convolutional neural networks (CNNs) achieve prevailing results in segmentation tasks nowadays and represent the state-of-the-art for image-based analysis. However, the understanding of the accurate decision-making process of a CNN is rather unknown. The research area of explainable artificial intelligence (xAI) primarily revolves around understanding and interpreting this black-box behavior. One way of interpreting a CNN is the use of class activation maps (CAMs) that represent heatmaps to indicate the importance of image areas for the prediction of the CNN. For classification tasks, a variety of CAM algorithms exist. But for segmentation tasks, only one CAM algorithm for the interpretation of the output of a CNN exist. We propose a transfer between existing classification- and segmentation-based methods for more detailed, explainable, and consistent results which show salient pixels in semantic segmentation tasks. The resulting Seg-HiRes-Grad CAM is an extension of the segmentation-based Seg-Grad CAM with the transfer to the classification-based HiRes CAM. Our method improves the previously-mentioned existing segmentation-based method by adjusting it to recently published classification-based methods. Especially for medical image segmentation, this transfer solves existing explainability disadvantages.
