Deformable Image Registration for Self-supervised Cardiac Phase Detection in Multi-View Multi-Disease Cardiac Magnetic Resonance Images
Sven Koehler, Sarah Kaye Mueller, Jonathan Kiekenap, Gerald Greil, Tarique Hussain, Samir Sarikouch, Florian André, Norbert Frey, Sandy Engelhardt
TL;DR
The paper addresses temporal misalignment in cine CMR by proposing a fully self-supervised deformable registration framework that yields a dense vector field $\phi_t$ and a one-dimensional motion descriptor $\alpha_t$, enabling detection of five keyframes: $ed$, $es$, $ms$, $pf$, and $md$. This approach operates in SAX and 4CH views and relies on a focus-point–based masking strategy, avoiding ECG labels. Evaluated on public multi-center datasets (MM2, MNMS, ACDC, GCN), it achieves substantial improvements in cyclic frame difference for $ed$ and $es$ compared with a volume-based baseline, with mean $cFD$ under 1.31 frames in SAX and 1.73 frames in LAX, and demonstrates generalization to rare congenital conditions. The work facilitates temporally aligned inter- and intra-patient cardiac dynamics analyses and supports extended phenotyping, with code released for reproducibility and further research.
Abstract
Cardiovascular magnetic resonance (CMR) is the gold standard for assessing cardiac function, but individual cardiac cycles complicate automatic temporal comparison or sub-phase analysis. Accurate cardiac keyframe detection can eliminate this problem. However, automatic methods solely derive end-systole (ES) and end-diastole (ED) frames from left ventricular volume curves, which do not provide a deeper insight into myocardial motion. We propose a self-supervised deep learning method detecting five keyframes in short-axis (SAX) and four-chamber long-axis (4CH) cine CMR. Initially, dense deformable registration fields are derived from the images and used to compute a 1D motion descriptor, which provides valuable insights into global cardiac contraction and relaxation patterns. From these characteristic curves, keyframes are determined using a simple set of rules. The method was independently evaluated for both views using three public, multicentre, multidisease datasets. M&Ms-2 (n=360) dataset was used for training and evaluation, and M&Ms (n=345) and ACDC (n=100) datasets for repeatability control. Furthermore, generalisability to patients with rare congenital heart defects was tested using the German Competence Network (GCN) dataset. Our self-supervised approach achieved improved detection accuracy by 30% - 51% for SAX and 11% - 47% for 4CH in ED and ES, as measured by cyclic frame difference (cFD), compared with the volume-based approach. We can detect ED and ES, as well as three additional keyframes throughout the cardiac cycle with a mean cFD below 1.31 frames for SAX and 1.73 for LAX. Our approach enables temporally aligned inter- and intra-patient analysis of cardiac dynamics, irrespective of cycle or phase lengths. GitHub repository: https://github.com/Cardio-AI/cmr-multi-view-phase-detection.git
