Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness
Hao Xu, Tengfei Xue, Jianan Fan, Dongnan Liu, Yuqian Chen, Fan Zhang, Carl-Fredrik Westin, Ron Kikinis, Lauren J. O'Donnell, Weidong Cai
TL;DR
Deformable medical image registration often relies solely on intensity similarity, risking inaccurate alignment in complex anatomies. The paper introduces a SAM-assisted registration framework that integrates SAM-generated segmentation masks, prototype contrastive learning and alignment, and contour-aware loss within a cross-fusion transformer to enforce both global semantic consistency and local boundary accuracy. It contributes explicit anatomical information injection, region-wise prototype learning, and contour-aware optimization, validated on Abdomen CT and ACDC MRI with state-of-the-art results. The approach enhances robustness and precision in challenging scenarios, enabling more reliable clinical image analysis.
Abstract
Medical image registration is a fundamental task in medical image analysis, aiming to establish spatial correspondences between paired images. However, existing unsupervised deformable registration methods rely solely on intensity-based similarity metrics, lacking explicit anatomical knowledge, which limits their accuracy and robustness. Vision foundation models, such as the Segment Anything Model (SAM), can generate high-quality segmentation masks that provide explicit anatomical structure knowledge, addressing the limitations of traditional methods that depend only on intensity similarity. Based on this, we propose a novel SAM-assisted registration framework incorporating prototype learning and contour awareness. The framework includes: (1) Explicit anatomical information injection, where SAM-generated segmentation masks are used as auxiliary inputs throughout training and testing to ensure the consistency of anatomical information; (2) Prototype learning, which leverages segmentation masks to extract prototype features and aligns prototypes to optimize semantic correspondences between images; and (3) Contour-aware loss, a contour-aware loss is designed that leverages the edges of segmentation masks to improve the model's performance in fine-grained deformation fields. Extensive experiments demonstrate that the proposed framework significantly outperforms existing methods across multiple datasets, particularly in challenging scenarios with complex anatomical structures and ambiguous boundaries. Our code is available at https://github.com/HaoXu0507/IPMI25-SAM-Assisted-Registration.
