MRI to PET Cross-Modality Translation using Globally and Locally Aware GAN (GLA-GAN) for Multi-Modal Diagnosis of Alzheimer's Disease
Apoorva Sikka, Skand Peri, Jitender Singh Virk, Usma Niyaz, Deepti R. Bathula
TL;DR
This work tackles cross-modality MRI-to-PET translation to aid Alzheimer's disease diagnosis by introducing GLA-GAN, a globally and locally aware multi-path GAN that combines a global generator with patch-based local generators. The model employs a composite loss (adversarial, voxel-wise L1, MS-SSIM, ROI) to preserve global structure, local detail, and region-specific information, achieving superior PET synthesis quality compared with state-of-the-art methods. Quantitative results show improved SSIM and reduced MAE, and the synthesized PET scans also enhance AD classification performance when used for data augmentation or completion of incomplete datasets. The authors further explore interpretability through visualization of internal features and interpolation analyses, while acknowledging limitations such as increased parameter count and the need for broader validation across disease stages like MCI. Overall, the approach offers a promising path toward practical, multi-modal diagnostic pipelines with reduced PET acquisition needs.
Abstract
Medical imaging datasets are inherently high dimensional with large variability and low sample sizes that limit the effectiveness of deep learning algorithms. Recently, generative adversarial networks (GANs) with the ability to synthesize realist images have shown great potential as an alternative to standard data augmentation techniques. Our work focuses on cross-modality synthesis of fluorodeoxyglucose~(FDG) Positron Emission Tomography~(PET) scans from structural Magnetic Resonance~(MR) images using generative models to facilitate multi-modal diagnosis of Alzheimer's disease (AD). Specifically, we propose a novel end-to-end, globally and locally aware image-to-image translation GAN (GLA-GAN) with a multi-path architecture that enforces both global structural integrity and fidelity to local details. We further supplement the standard adversarial loss with voxel-level intensity, multi-scale structural similarity (MS-SSIM) and region-of-interest (ROI) based loss components that reduce reconstruction error, enforce structural consistency at different scales and perceive variation in regional sensitivity to AD respectively. Experimental results demonstrate that our GLA-GAN not only generates synthesized FDG-PET scans with enhanced image quality but also superior clinical utility in improving AD diagnosis compared to state-of-the-art models. Finally, we attempt to interpret some of the internal units of the GAN that are closely related to this specific cross-modality generation task.
