Fully Differentiable Correlation-driven 2D/3D Registration for X-ray to CT Image Fusion
Minheng Chen, Zhirun Zhang, Shuheng Gu, Zhangyang Ge, Youyong Kong
TL;DR
This work tackles rigid 2D/3D X-ray to CT registration by addressing interpretability, controllability, and limited capture range in fully differentiable methods. It introduces a correlation-driven network with a dual-branch CNN-Transformer encoder that decouples low-frequency global and high-frequency local features, coupled with a correlation-based decomposition loss and a training strategy that approximates a convex similarity function. The framework predicts a relative SE(3) pose and performs gradient-based iterative refinement, guided by a loss that combines $L_{appro}$ and $L_{decomp}$ with learnable uncertainty parameters and NCC-based feature decomposition. Evaluations on a spine-focused, simulated X-ray dataset demonstrate improved accuracy and robustness over CMA-ES baselines and existing differentiable methods, suggesting enhanced interpretability and capture range for clinical image-guided interventions. The approach has potential to improve real-time registration performance and reliability in fluoroscopic procedures.
Abstract
Image-based rigid 2D/3D registration is a critical technique for fluoroscopic guided surgical interventions. In recent years, some learning-based fully differentiable methods have produced beneficial outcomes while the process of feature extraction and gradient flow transmission still lack controllability and interpretability. To alleviate these problems, in this work, we propose a novel fully differentiable correlation-driven network using a dual-branch CNN-transformer encoder which enables the network to extract and separate low-frequency global features from high-frequency local features. A correlation-driven loss is further proposed for low-frequency feature and high-frequency feature decomposition based on embedded information. Besides, a training strategy that learns to approximate a convex-shape similarity function is applied in our work. We test our approach on a in-house datasetand show that it outperforms both existing fully differentiable learning-based registration approaches and the conventional optimization-based baseline.
