Dual-domain Multi-path Self-supervised Diffusion Model for Accelerated MRI Reconstruction
Yuxuan Zhang, Jinkui Hao, Bo Zhou
TL;DR
This work tackles the challenge of long MRI acquisition times by introducing DMSM, a Dual-domain Multi-path Self-supervised Diffusion Model. It combines dual-domain self-supervised training with a lightweight LHAN-DC reconstruction backbone and a multi-path inference strategy to produce high-quality reconstructions from undersampled data while providing uncertainty estimates. The method achieves state-of-the-art performance on fastMRI and IXI across multiple contrasts and acceleration factors, approaching fully supervised results with a compact ~0.8M parameter footprint and without requiring fully sampled training data. The uncertainty maps derived from multi-path reconstructions offer clinically interpretable guidance, potentially improving diagnostic confidence and workflow efficiency.
Abstract
Magnetic resonance imaging (MRI) is a vital diagnostic tool, but its inherently long acquisition times reduce clinical efficiency and patient comfort. Recent advancements in deep learning, particularly diffusion models, have improved accelerated MRI reconstruction. However, existing diffusion models' training often relies on fully sampled data, models incur high computational costs, and often lack uncertainty estimation, limiting their clinical applicability. To overcome these challenges, we propose a novel framework, called Dual-domain Multi-path Self-supervised Diffusion Model (DMSM), that integrates a self-supervised dual-domain diffusion model training scheme, a lightweight hybrid attention network for the reconstruction diffusion model, and a multi-path inference strategy, to enhance reconstruction accuracy, efficiency, and explainability. Unlike traditional diffusion-based models, DMSM eliminates the dependency on training from fully sampled data, making it more practical for real-world clinical settings. We evaluated DMSM on two human MRI datasets, demonstrating that it achieves favorable performance over several supervised and self-supervised baselines, particularly in preserving fine anatomical structures and suppressing artifacts under high acceleration factors. Additionally, our model generates uncertainty maps that correlate reasonably well with reconstruction errors, offering valuable clinically interpretable guidance and potentially enhancing diagnostic confidence.
