Pretext Task Adversarial Learning for Unpaired Low-field to Ultra High-field MRI Synthesis
Zhenxuan Zhang, Peiyuan Jing, Coraline Beitone, Jiahao Huang, Zhifan Gao, Guang Yang, Pete Lally
TL;DR
This paper addresses the scarcity of high-field MRI data by proposing an unpaired low-field to ultra high-field MRI synthesis method. The method, called Pretext Task Adversarial (PTA) learning, integrates Slice-wise Gap Perception (SGP) using contrastive learning, Local Structure Correction (LSC) via a pretext task of rotating and masking local blocks, and adversarial training guided by a pretext task to improve realism. The framework computes a composite objective $L_{all} = \lambda_1 \mathcal{L}_{syn} + \lambda_2 \mathcal{L}_{cycle} + \lambda_3 \mathcal{L}_{adv}$ with weights $\lambda_1=0.5$, $\lambda_2=0.2$, $\lambda_3=0.3$, and uses a discriminator to enforce realism, while a cycle-consistency constraint preserves anatomical fidelity. Extensive experiments demonstrate state-of-the-art performance on low-field to ultra high-field MRI synthesis, achieving $16.892$ for FID, $1.933$ for IS, and $0.324$ for MS-SSIM, and ablations plus MRI physicist evaluation support the contributions.
Abstract
Given the scarcity and cost of high-field MRI, the synthesis of high-field MRI from low-field MRI holds significant potential when there is limited data for training downstream tasks (e.g. segmentation). Low-field MRI often suffers from a reduced signal-to-noise ratio (SNR) and spatial resolution compared to high-field MRI. However, synthesizing high-field MRI data presents challenges. These involve aligning image features across domains while preserving anatomical accuracy and enhancing fine details. To address these challenges, we propose a Pretext Task Adversarial (PTA) learning framework for high-field MRI synthesis from low-field MRI data. The framework comprises three processes: (1) The slice-wise gap perception (SGP) network aligns the slice inconsistencies of low-field and high-field datasets based on contrastive learning. (2) The local structure correction (LSC) network extracts local structures by restoring the locally rotated and masked images. (3) The pretext task-guided adversarial training process introduces additional supervision and incorporates a discriminator to improve image realism. Extensive experiments on low-field to ultra high-field task demonstrate the effectiveness of our method, achieving state-of-the-art performance (16.892 in FID, 1.933 in IS, and 0.324 in MS-SSIM). This enables the generation of high-quality high-field-like MRI data from low-field MRI data to augment training datasets for downstream tasks. The code is available at: https://github.com/Zhenxuan-Zhang/PTA4Unpaired_HF_MRI_SYN.
