Multi-resolution Guided 3D GANs for Medical Image Translation
Juhyung Ha, Jong Sung Park, David Crandall, Eleftherios Garyfallidis, Xuhong Zhang
TL;DR
The paper addresses cross-modality 3D medical image translation by introducing a multi-resolution guided GAN framework that employs a 3D-mDAUNet generator and a multi-resolution discriminator, optimized with voxel-wise and 2.5D perceptual losses. The loss design combines $L_G = \lambda_1 L_{voxel} + \lambda_2 L_{perception} + \lambda_3 L_{adv}$ with $\lambda_1=\lambda_2=1$ and $\lambda_3=0.0001$, and uses voxel-wise relativistic discrimination for stable training. Extensive experiments across MRI, CBCT, and CT datasets show state-of-the-art performance in both image quality and downstream segmentation tasks, often outperforming ResViT, PTNet3D, and Ea-GAN, with Dice scores reaching up to 0.880 and 0.836 in synthetic-to-real settings. The study highlights that traditional IQA metrics may not fully capture clinical utility, advocating multifaceted evaluation and demonstrating practical potential for reducing additional imaging acquisitions. The authors also provide open-source code at github.com/juhha/3D-mADUNet.
Abstract
Medical image translation is the process of converting from one imaging modality to another, in order to reduce the need for multiple image acquisitions from the same patient. This can enhance the efficiency of treatment by reducing the time, equipment, and labor needed. In this paper, we introduce a multi-resolution guided Generative Adversarial Network (GAN)-based framework for 3D medical image translation. Our framework uses a 3D multi-resolution Dense-Attention UNet (3D-mDAUNet) as the generator and a 3D multi-resolution UNet as the discriminator, optimized with a unique combination of loss functions including voxel-wise GAN loss and 2.5D perception loss. Our approach yields promising results in volumetric image quality assessment (IQA) across a variety of imaging modalities, body regions, and age groups, demonstrating its robustness. Furthermore, we propose a synthetic-to-real applicability assessment as an additional evaluation to assess the effectiveness of synthetic data in downstream applications such as segmentation. This comprehensive evaluation shows that our method produces synthetic medical images not only of high-quality but also potentially useful in clinical applications. Our code is available at github.com/juhha/3D-mADUNet.
