Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Medical Image Synthesis: T1w MRI to Tau PET
Junho Moon, Symac Kim, Haejun Chung, Ikbeom Jang
TL;DR
This work tackles the problem of synthesizing tau PET images from T1-weighted MRI to support AD assessment by introducing a cyclic 2.5D perceptual loss that alternates plane-wise perceptual learning across axial, coronal, and sagittal views with progressively shrinking cycles. The method is combined with SSIM and MSE losses and paired with by-manufacturer SUVR standardization to mitigate inter-scanner variability. Evaluations across multiple architectures (including 3D U-Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix) show superior SSIM and ROI fidelity compared with 2.5D, 3D perceptual losses and existing PET-from-MRI approaches, indicating improved preservation of tau pathology. The approach offers a practical, noninvasive surrogate for tau burden that could aid pre-screening and triage in clinical workflows, with implications for broader access to tau-related assessments.
Abstract
There is a demand for medical image synthesis or translation to generate synthetic images of missing modalities from available data. This need stems from challenges such as restricted access to high-cost imaging devices, government regulations, or failure to follow up with patients or study participants. In medical imaging, preserving high-level semantic features is often more critical than achieving pixel-level accuracy. Perceptual loss functions are widely employed to train medical image synthesis or translation models, as they quantify differences in high-level image features using a pre-trained feature extraction network. While 3D and 2.5D perceptual losses are used in 3D medical image synthesis, they face challenges, such as the lack of pre-trained 3D models or difficulties in balancing loss reduction across different planes. In this work, we focus on synthesizing 3D tau PET images from 3D T1-weighted MR images. We propose a cyclic 2.5D perceptual loss that sequentially computes the 2D average perceptual loss for each of the axial, coronal, and sagittal planes over epochs, with the cycle duration gradually decreasing. Additionally, we process tau PET images using by-manufacturer standardization to enhance the preservation of high-SUVR regions indicative of tau pathology and mitigate SUVR variability caused by inter-manufacturer differences. We combine the proposed loss with SSIM and MSE losses and demonstrate its effectiveness in improving both quantitative and qualitative performance across various generative models, including U-Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix.
