Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection
Chunlei Li, Yilei Shi, Jingliang Hu, Xiao Xiang Zhu, Lichao Mou
TL;DR
We address unsupervised medical anomaly detection by introducing a scale-aware contrastive reverse distillation framework that uses a clean and a noisy teacher alongside a student decoder. A scale adaptation module learns per-scale weights to handle anomaly size variation, while simplex-noise-based anomaly synthesis drives discriminative, out-of-normal representations. Empirical results on RSNA, Brain Tumor MRI, and ISIC 2018 show state-of-the-art performance across AUC, F1, and accuracy, with ablations confirming the contributions of CRD and scale weighting. The approach offers practical benefits for medical imaging, including efficient inference and robust handling of multi-scale anomalies, and provides a codebase for reproducibility.
Abstract
Unsupervised anomaly detection using deep learning has garnered significant research attention due to its broad applicability, particularly in medical imaging where labeled anomalous data are scarce. While earlier approaches leverage generative models like autoencoders and generative adversarial networks (GANs), they often fall short due to overgeneralization. Recent methods explore various strategies, including memory banks, normalizing flows, self-supervised learning, and knowledge distillation, to enhance discrimination. Among these, knowledge distillation, particularly reverse distillation, has shown promise. Following this paradigm, we propose a novel scale-aware contrastive reverse distillation model that addresses two key limitations of existing reverse distillation methods: insufficient feature discriminability and inability to handle anomaly scale variations. Specifically, we introduce a contrastive student-teacher learning approach to derive more discriminative representations by generating and exploring out-of-normal distributions. Further, we design a scale adaptation mechanism to softly weight contrastive distillation losses at different scales to account for the scale variation issue. Extensive experiments on benchmark datasets demonstrate state-of-the-art performance, validating the efficacy of the proposed method. Code is available at https://github.com/MedAITech/SCRD4AD.
