Identity Preserving Latent Diffusion for Brain Aging Modeling
Gexin Huang, Zhangsihao Yang, Yalin Wang, Guido Gerig, Mengwei Ren, Xiaoxiao Li
TL;DR
This work introduces IP-LDM, an age- and identity-conditioned latent diffusion model designed for longitudinal brain aging synthesis that preserves intra-subject identity. By combining an age encoder, a triplet-contrastive identity representation learner, and an identity-controlled latent diffusion transformation, the method enables continuous and realistic age progression from a single source brain image. Empirical results on elderly (OASIS-3) and infant (IBIS/Baby Brain) datasets show superior identity preservation and image quality, with quantitative gains in SSIM, PSNR, FID, KID, RMSE, and ARI over strong baselines. The approach offers a practical tool for studying brain aging dynamics and could impact neurodevelopmental and neurodegenerative research, though it is presently limited by dataset size and 2D processing, with future work aimed at 3D extensions and broader generalization.
Abstract
Structural and appearance changes in brain imaging over time are crucial indicators of neurodevelopment and neurodegeneration. The rapid advancement of large-scale generative models provides a promising backbone for modeling these complex global and local changes in brain images, such as transforming the age of a source image to a target age. However, current generative models, typically trained on independently and identically distributed (i.i.d.) data, may struggle to maintain intra-subject spatiotemporal consistency during transformations. We propose the Identity-Preserving Longitudinal Diffusion Model (IP-LDM), designed to accurately transform brain ages while preserving subject identity. Our approach involves first extracting the identity representation from the source image. Then, conditioned on the target age, the latent diffusion model learns to generate the age-transformed target image. To ensure consistency within the same subject over time, we regularize the identity representation using a triplet contrastive formulation. Our experiments on both elderly and infant brain datasets demonstrate that our model outperforms existing conditional generative models, producing realistic age transformations while preserving intra-subject identity.
