Deep Generative Model-Based Generation of Synthetic Individual-Specific Brain MRI Segmentations
Ruijie Wang, Luca Rossetto, Susan Mérillat, Christina Röcke, Mike Martin, Abraham Bernstein
TL;DR
This work tackles the data scarcity of individual-specific brain MRI segmentations by introducing CSegSynth, a conditional deep generative model that synthesizes 3D WM, GM, and CSF segmentations from easily obtainable demographic, interview, and cognitive features. The approach combines unconditional pre-training on large MRI datasets (AOMIC ID1000) with conditional fine-tuning on CamCAN, using four architectures (VAE, GAN, LDM, and $\alpha$-GAN) and a dedicated conditional model (CSegSynth) to generate individual-specific segmentations. Empirical results show that CSegSynth achieves state-of-the-art quality across distributional similarity metrics (MMD, 2D/3D-FID, $|\Delta SSIM|$) and substantially improves volume-prediction accuracy for $WM$, $GM$, and $CSF$ with MAEs of $36.44$, $29.20$, and $35.51$ mL, respectively, outperforming conventional regression baselines. The model also enables novel neuroscience applications, including gradient-based feature importance analyses and hypothetical segmentation generation for aging and lifestyle scenarios, while acknowledging limitations such as limited training data and challenges in validating hypothetical trajectories. Overall, CSegSynth offers a principled, scalable path to individual-specific brain structure studies under data privacy and scarcity constraints, with publicly available code and data resources cited.
Abstract
To the best of our knowledge, all existing methods that can generate synthetic brain magnetic resonance imaging (MRI) scans for a specific individual require detailed structural or volumetric information about the individual's brain. However, such brain information is often scarce, expensive, and difficult to obtain. In this paper, we propose the first approach capable of generating synthetic brain MRI segmentations -- specifically, 3D white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF) segmentations -- for individuals using their easily obtainable and often readily available demographic, interview, and cognitive test information. Our approach features a novel deep generative model, CSegSynth, which outperforms existing prominent generative models, including conditional variational autoencoder (C-VAE), conditional generative adversarial network (C-GAN), and conditional latent diffusion model (C-LDM). We demonstrate the high quality of our synthetic segmentations through extensive evaluations. Also, in assessing the effectiveness of the individual-specific generation, we achieve superior volume prediction, with mean absolute errors of only 36.44mL, 29.20mL, and 35.51mL between the ground-truth WM, GM, and CSF volumes of test individuals and those volumes predicted based on generated individual-specific segmentations, respectively.
