Uncertainty Estimation for Pretrained Medical Image Registration Models via Transformation Equivariance
Lin Tian, Xiaoling Hu, Juan Eugenio Iglesias
TL;DR
This work tackles the lack of uncertainty estimates in pretrained medical image registration by introducing an inference-time, model-agnostic framework based on transformation equivariance. By perturbing the source image with spatial transforms and analyzing the variability of the composed predictions, the method produces per-voxel uncertainty maps that correlate with true registration error without retraining. The uncertainty is theoretically decomposed into Intrinsic Spread and Bias Jitter, with concrete formulas linking the variance to the underlying error, and is validated across brain, cardiac, and abdominal datasets using three pretrained backbones. Empirically, the approach shows strong to moderate agreement with MC-dropout and meaningful case studies demonstrating its ability to flag anatomically inconsistent regions, enabling safer, risk-aware deployment of pretrained registration models in clinical and research settings.
Abstract
Accurate image registration is essential in many medical imaging applications, yet most deep registration networks provide little indication of when or where their predictions are unreliable. Existing uncertainty estimation approaches, such as Bayesian methods, ensembles, or MC-dropout, typically require architectural modifications or retraining, precluding their applicability to pretrained registration models. We propose an inference-time, model-agnostic uncertainty estimation framework that applies directly to any pretrained registration network. Our approach is grounded in the transformation equivariance property of image registration, which states that the underlying anatomical mapping should remain consistent under spatial perturbations of the input. Experiments across three pretrained registration models and four anatomical structures show that the resulting uncertainty maps consistently correlate with registration error and highlight unreliably aligned regions. This framework turns pretrained registration networks into risk-aware tools at test time, moving medical image registration closer to safe clinical and large-scale research deployment.
