Multi-modal deformable image registration using untrained neural networks
Quang Luong Nhat Nguyen, Ruiming Cao, Laura Waller
TL;DR
The paper tackles the challenge of versatile image registration that across rigid/deformable and single/multi-modal data without task-specific models or ground-truth training data. It proposes untrained coordinate-based networks as implicit priors, using a motion network to generate a dense displacement field and an image network to store content, with $I_{trans} = f_{im}((x,y) + f_{mo}(x,y;\theta_{mo}); \theta_{im})$ and an $L_2$ objective. Model capacity is controlled via hash-encoded multi-resolution features and a coarse-to-fine strategy that first prioritizes motion alignment before refining image content, enabling robust performance across 2D and 3D datasets. Across Zurich 2D and Abdomen MR-CT tasks, the method outperforms baselines on single- and multi-modal registration, including deformable cases, illustrating its potential as a general, data-agnostic registration framework with practical implications for multimodal imaging and analysis.
Abstract
Image registration techniques usually assume that the images to be registered are of a certain type (e.g. single- vs. multi-modal, 2D vs. 3D, rigid vs. deformable) and there lacks a general method that can work for data under all conditions. We propose a registration method that utilizes neural networks for image representation. Our method uses untrained networks with limited representation capacity as an implicit prior to guide for a good registration. Unlike previous approaches that are specialized for specific data types, our method handles both rigid and non-rigid, as well as single- and multi-modal registration, without requiring changes to the model or objective function. We have performed a comprehensive evaluation study using a variety of datasets and demonstrated promising performance.
