Semi-Supervised Diffusion Model for Brain Age Prediction
Ayodeji Ijishakin, Sophie Martin, Florence Townend, Federica Agosta, Edoardo Gioele Spinelli, Silvia Basaia, Paride Schito, Yuri Falzone, Massimo Filippi, James Cole, Andrea Malaspina
TL;DR
Brain age prediction in neurodegenerative contexts often suffers from data quality and rapid disease progression. The authors introduce a semi-supervised diffusion model built as a diffusion autoencoder conditioned on a semantic latent, with an age predictor mapping $\mathbf{z}_{\text{sem}}$ to age, achieving a test correlation of $r=0.83$ ($p<0.01$) and $R^2=0.65$ with $MAE=5$ years on clinical-grade data. Importantly, brain-PADs derived from this model significantly correlate with ALS survival ($r=0.24$, $p<0.05$), suggesting the approach captures aging-related neuroanatomy linked to outcomes. The method is competitive with non-generative baselines and demonstrates that diffusion-based representations can be robust to data quality, with potential clinical utility in prognosis and trial design.
Abstract
Brain age prediction models have succeeded in predicting clinical outcomes in neurodegenerative diseases, but can struggle with tasks involving faster progressing diseases and low quality data. To enhance their performance, we employ a semi-supervised diffusion model, obtaining a 0.83(p<0.01) correlation between chronological and predicted age on low quality T1w MR images. This was competitive with state-of-the-art non-generative methods. Furthermore, the predictions produced by our model were significantly associated with survival length (r=0.24, p<0.05) in Amyotrophic Lateral Sclerosis. Thus, our approach demonstrates the value of diffusion-based architectures for the task of brain age prediction.
