Towards All-in-One Medical Image Re-Identification
Yuan Tian, Kaiyuan Ji, Rongzhao Zhang, Yankai Jiang, Chunyi Li, Xiaosong Wang, Guangtao Zhai
TL;DR
This work tackles medical image re-identification across diverse imaging modalities by introducing MaMI, an all-in-one MedReID framework. MaMI employs a Continuous Modality-based Parameter Adapter (ComPA) to dynamically convert a modality-agnostic backbone into modality-specific models at runtime, and it integrates rich medical priors from pre-trained medical foundation models through inter-image difference alignment to emphasize identity-related features. The approach is validated across 11 datasets and against numerous foundation models and large multi-modal language models, consistently outperforming baselines and demonstrating cross-modality and cross-domain generalization. Additionally, MaMI enables practical applications in history-augmented diagnosis and privacy protection by retrieving historical data and by identifying and removing identity cues from images, respectively. The work provides a thorough benchmark, detailed ablations, and publicly available code to facilitate adoption and further research in cross-modality MedReID and privacy-preserving medical imaging.
Abstract
Medical image re-identification (MedReID) is under-explored so far, despite its critical applications in personalized healthcare and privacy protection. In this paper, we introduce a thorough benchmark and a unified model for this problem. First, to handle various medical modalities, we propose a novel Continuous Modality-based Parameter Adapter (ComPA). ComPA condenses medical content into a continuous modality representation and dynamically adjusts the modality-agnostic model with modality-specific parameters at runtime. This allows a single model to adaptively learn and process diverse modality data. Furthermore, we integrate medical priors into our model by aligning it with a bag of pre-trained medical foundation models, in terms of the differential features. Compared to single-image feature, modeling the inter-image difference better fits the re-identification problem, which involves discriminating multiple images. We evaluate the proposed model against 25 foundation models and 8 large multi-modal language models across 11 image datasets, demonstrating consistently superior performance. Additionally, we deploy the proposed MedReID technique to two real-world applications, i.e., history-augmented personalized diagnosis and medical privacy protection. Codes and model is available at \href{https://github.com/tianyuan168326/All-in-One-MedReID-Pytorch}{https://github.com/tianyuan168326/All-in-One-MedReID-Pytorch}.
