Part-based Quantitative Analysis for Heatmaps
Osman Tursun, Sinan Kalkan, Simon Denman, Sridha Sridharan, Clinton Fookes
TL;DR
This work introduces Part-based Quantitative Analysis for Heatmaps (PQAH), a quantitative framework that measures heatmap activation overlap with semantic object parts to produce fine-grained, objective XAI insights. By leveraging part annotations and a defined overlap metric (PH) based on $F_1$, PQAH generates per-part and background scores, aggregates them into quartiles $PH_{Q1}$, $PH_{Q2}$, and $PH_{Q3}$, and visualizes the results with boxplots. The authors couple PQAH with a pipeline for automated, end-user XAI reporting via large language models (e.g., GPT-4) and demonstrate the approach on PartImageNet and PASCAL-Part using multiple backbones (ResNet-50, VGG-16, ViT). They further illustrate a medical use case where PQAH-guided data augmentation reduces region-based biases and improves diagnostic accuracy, underscoring PQAH’s utility for both model debugging and user-friendly explanations. The work highlights that PQAH provides a scalable, granular lens for heatmap evaluation and model improvement, while acknowledging that human-centered interpretation remains a consideration and suggesting directions like specialized LLMs to enhance critique generation.
Abstract
Heatmaps have been instrumental in helping understand deep network decisions, and are a common approach for Explainable AI (XAI). While significant progress has been made in enhancing the informativeness and accessibility of heatmaps, heatmap analysis is typically very subjective and limited to domain experts. As such, developing automatic, scalable, and numerical analysis methods to make heatmap-based XAI more objective, end-user friendly, and cost-effective is vital. In addition, there is a need for comprehensive evaluation metrics to assess heatmap quality at a granular level.
