DWARF: Disease-weighted network for attention map refinement
Haozhe Luo, Aurélie Pahud de Mortanges, Oana Inel, Abraham Bernstein, Mauricio Reyes
TL;DR
DWARF tackles interpretability in medical imaging by integrating clinicians into the training loop to refine attention maps via disease-specific guidance. It combines a pretrained Vision-Language Model with disease-specific segmentation heads and cyclic training to align explanations with findings. Across ChestX-Det, CheXlocalize, and Vindr-CXR, DWARF achieves state-of-the-art performance and more trustworthy attention maps, while clinician evaluations indicate higher confidence in AI-assisted classifications. The work also introduces Identity Enhanced Initialization to mitigate shortcut learning and discusses future directions for transferability and few-shot adaptation.
Abstract
The interpretability of deep learning is crucial for evaluating the reliability of medical imaging models and reducing the risks of inaccurate patient recommendations. This study addresses the "human out of the loop" and "trustworthiness" issues in medical image analysis by integrating medical professionals into the interpretability process. We propose a disease-weighted attention map refinement network (DWARF) that leverages expert feedback to enhance model relevance and accuracy. Our method employs cyclic training to iteratively improve diagnostic performance, generating precise and interpretable feature maps. Experimental results demonstrate significant improvements in interpretability and diagnostic accuracy across multiple medical imaging datasets. This approach fosters effective collaboration between AI systems and healthcare professionals, ultimately aiming to improve patient outcomes
