Disentangled Diffusion Autoencoder for Harmonization of Multi-site Neuroimaging Data
Ayodeji Ijishakin, Ana Lawry Aguila, Elizabeth Levitis, Ahmed Abdulaal, Andre Altmann, James Cole
TL;DR
This work addresses the challenge of multi-site neuroimaging harmonization by introducing the Disentangled Diffusion Autoencoder (DDAE), a diffusion-model framework that disentangles known covariates from unknown biological variance via separate latent representations. By conditioning the reverse diffusion process on both latents, DDAE can generate site-adjusted MR images that preserve biological variability while removing site-specific bias. In experiments across 7 sites on 4120 2D MR slices, DDAE achieves superior image quality (lowest FID), effective site removal, and robust preservation of age and sex information, while maintaining within-site variability, outperforming ComBat, a cVAE, and a Style-Encoding GAN. The approach demonstrates the potential of diffusion-based harmonization in neuroimaging and suggests extensions to 3D and vertex-wise data for broader applicability.
Abstract
Combining neuroimaging datasets from multiple sites and scanners can help increase statistical power and thus provide greater insight into subtle neuroanatomical effects. However, site-specific effects pose a challenge by potentially obscuring the biological signal and introducing unwanted variance. Existing harmonization techniques, which use statistical models to remove such effects, have been shown to incompletely remove site effects while also failing to preserve biological variability. More recently, generative models using GANs or autoencoder-based approaches, have been proposed for site adjustment. However, such methods are known for instability during training or blurry image generation. In recent years, diffusion models have become increasingly popular for their ability to generate high-quality synthetic images. In this work, we introduce the disentangled diffusion autoencoder (DDAE), a novel diffusion model designed for controlling specific aspects of an image. We apply the DDAE to the task of harmonizing MR images by generating high-quality site-adjusted images that preserve biological variability. We use data from 7 different sites and demonstrate the DDAE's superiority in generating high-resolution, harmonized 2D MR images over previous approaches. As far as we are aware, this work marks the first diffusion-based model for site adjustment of neuroimaging data.
