UniReg: Foundation Model for Controllable Medical Image Registration
Zi Li, Jianpeng Zhang, Tai Ma, Tony C. W. Mok, Yan-Jie Zhou, Zeli Chen, Xianghua Ye, Le Lu, Dakai Jin
TL;DR
UniReg introduces a foundation-style, controllable universal registration model that unifies multiple CT registration tasks under a single network by conditioning a dynamic deformation head on anatomical priors, inter-/intra-subject context, and instance features. A SAM-based shared backbone extracts robust anatomical representations, while a lightweight controller generates task-specific deformation kernels to produce a smooth, plausible deformation field for warping moving images toward fixed targets. The approach achieves state-of-the-art or near state-of-the-art accuracy across inter- and intra-subject registrations for 90 organs, with roughly half the training iterations of conventional task-specific models and strong generalization to unseen tasks via minimal conditioning. This yields a flexible, efficient solution for large-scale clinical workflows, enabling rapid adaptation to diverse registration scenarios with reduced computational demands.
Abstract
Learning-based medical image registration has achieved performance parity with conventional methods while demonstrating a substantial advantage in computational efficiency. However, learning-based registration approaches lack generalizability across diverse clinical scenarios, requiring the laborious development of multiple isolated networks for specific registration tasks, e.g., inter-/intra-subject registration or organ-specific alignment. % To overcome this limitation, we propose \textbf{UniReg}, the first interactive foundation model for medical image registration, which combines the precision advantages of task-specific learning methods with the generalization of traditional optimization methods. Our key innovation is a unified framework for diverse registration scenarios, achieved through a conditional deformation field estimation within a unified registration model. This is realized through a dynamic learning paradigm that explicitly encodes: (1) anatomical structure priors, (2) registration type constraints (inter/intra-subject), and (3) instance-specific features, enabling the generation of scenario-optimal deformation fields. % Through comprehensive experiments encompassing $90$ anatomical structures at different body regions, our UniReg model demonstrates comparable performance with contemporary state-of-the-art methodologies while achieving ~50\% reduction in required training iterations relative to the conventional learning-based paradigm. This optimization contributes to a significant reduction in computational resources, such as training time. Code and model will be available.
