Efficient and Concise Explanations for Object Detection with Gaussian-Class Activation Mapping Explainer
Quoc Khanh Nguyen, Truong Thanh Hung Nguyen, Vo Thanh Khang Nguyen, Van Binh Truong, Tuong Phan, Hung Cao
TL;DR
The paper addresses the need for fast and faithful explanations in object detection by introducing G-CAME, a Gaussian Class Activation Mapping Explainer. G-CAME extends CAM-based XAI to detectors by gradient-guided localization and Gaussian masking, producing concise saliency maps for a targeted object. It demonstrates superior localization fidelity and reduced noise compared with region-based methods like D-RISE, achieving around 0.5s per object and improved tiny-object bias on Faster-RCNN and YOLOX across MS-COCO 2017. The approach enables near real-time, targeted explanations with improved faithfulness and clearer visualizations for practical deployment.
Abstract
To address the challenges of providing quick and plausible explanations in Explainable AI (XAI) for object detection models, we introduce the Gaussian Class Activation Mapping Explainer (G-CAME). Our method efficiently generates concise saliency maps by utilizing activation maps from selected layers and applying a Gaussian kernel to emphasize critical image regions for the predicted object. Compared with other Region-based approaches, G-CAME significantly reduces explanation time to 0.5 seconds without compromising the quality. Our evaluation of G-CAME, using Faster-RCNN and YOLOX on the MS-COCO 2017 dataset, demonstrates its ability to offer highly plausible and faithful explanations, especially in reducing the bias on tiny object detection.
