Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays
Zhichao Sun, Yuliang Gu, Yepeng Liu, Zerui Zhang, Zhou Zhao, Yongchao Xu
TL;DR
This work tackles unsupervised anomaly detection in chest X-rays by leveraging a frozen CLIP backbone with position-guided prompt learning (PPAD) to bridge domain gaps between pretraining data and clinical images. It introduces learnable text and image prompts that focus on lung subregions and a structure-preserving anomaly synthesis (SAS) to generate authentic synthetic lesions during training. PPAD delivers state-of-the-art performance across ZhangLab, CheXpert, and VinDr-CXR datasets, outperforming both CLIP-based methods and other prompt-learning approaches, with strong ablations confirming the effectiveness of regional prompts and SAS. The approach is computationally efficient, requiring only a tiny fraction of learnable parameters and demonstrating practical potential for scalable, few-shot anomaly detection in medical imaging.
Abstract
Anomaly detection in chest X-rays is a critical task. Most methods mainly model the distribution of normal images, and then regard significant deviation from normal distribution as anomaly. Recently, CLIP-based methods, pre-trained on a large number of medical images, have shown impressive performance on zero/few-shot downstream tasks. In this paper, we aim to explore the potential of CLIP-based methods for anomaly detection in chest X-rays. Considering the discrepancy between the CLIP pre-training data and the task-specific data, we propose a position-guided prompt learning method. Specifically, inspired by the fact that experts diagnose chest X-rays by carefully examining distinct lung regions, we propose learnable position-guided text and image prompts to adapt the task data to the frozen pre-trained CLIP-based model. To enhance the model's discriminative capability, we propose a novel structure-preserving anomaly synthesis method within chest x-rays during the training process. Extensive experiments on three datasets demonstrate that our proposed method outperforms some state-of-the-art methods. The code of our implementation is available at https://github.com/sunzc-sunny/PPAD.
