Diffuse-UDA: Addressing Unsupervised Domain Adaptation in Medical Image Segmentation with Appearance and Structure Aligned Diffusion Models
Haifan Gong, Yitao Wang, Yihan Wang, Jiashun Xiao, Xiang Wan, Haofeng Li
TL;DR
Diffuse-UDA tackles unsupervised domain adaptation for 3D medical image segmentation under cross-center and cross-modality domain shifts, where voxel-level labels are scarce. It introduces ASCPlus to improve target pseudo-label quality and a conditional diffusion model with deformable augmentation to synthesize high-quality image–mask pairs aligned to the target domain, then trains on a mix of source and generated data. Extensive experiments on FeTA fetal brain MRI and MM-WHS cardiac datasets show that Diffuse-UDA outperforms state-of-the-art UDA and SSL methods and approaches or even surpasses the performance of models trained with target-domain labels. The approach also provides synthetic data at scale and demonstrates improved feature alignment across domains, suggesting practical potential for fairer and more robust AI deployment in diverse clinical settings. Overall, Diffuse-UDA offers a scalable plug-and-play solution to bridge domain gaps in medical imaging and enable cross-center adoption of AI tools.
Abstract
The scarcity and complexity of voxel-level annotations in 3D medical imaging present significant challenges, particularly due to the domain gap between labeled datasets from well-resourced centers and unlabeled datasets from less-resourced centers. This disparity affects the fairness of artificial intelligence algorithms in healthcare. We introduce Diffuse-UDA, a novel method leveraging diffusion models to tackle Unsupervised Domain Adaptation (UDA) in medical image segmentation. Diffuse-UDA generates high-quality image-mask pairs with target domain characteristics and various structures, thereby enhancing UDA tasks. Initially, pseudo labels for target domain samples are generated. Subsequently, a specially tailored diffusion model, incorporating deformable augmentations, is trained on image-label or image-pseudo-label pairs from both domains. Finally, source domain labels guide the diffusion model to generate image-label pairs for the target domain. Comprehensive evaluations on several benchmarks demonstrate that Diffuse-UDA outperforms leading UDA and semi-supervised strategies, achieving performance close to or even surpassing the theoretical upper bound of models trained directly on target domain data. Diffuse-UDA offers a pathway to advance the development and deployment of AI systems in medical imaging, addressing disparities between healthcare environments. This approach enables the exploration of innovative AI-driven diagnostic tools, improves outcomes, saves time, and reduces human error.
