A Brain Age Residual Biomarker (BARB): Leveraging MRI-Based Models to Detect Latent Health Conditions in U.S. Veterans
Shahrzad Jamshidi, Arthur Bousquet, Sugata Banerji, Mark F. Conneely, Bita Aslrousta
TL;DR
The paper addresses detecting latent health states via brain age residual biomarkers (BARBs) from MRI data. It trains four CNNs on 2D slices from T2-weighted FSE and T2-weighted FLAIR images at two brain locations and aggregates their predictions with a degree-3 polynomial regression to estimate brain age. On a veteran cohort of 1,220 individuals (1,104 with all four images), the approach achieves $R^2=0.816$ and MAE=$5.450$, with residual patterns that relate to the number of ICD-coded conditions, particularly in those aged $>49$. The work demonstrates BARB feasibility in a small, region-specific dataset and highlights the need for broader validation, richer health metrics, and multi-modal data to improve generalizability and clinical utility.
Abstract
Age prediction using brain imaging, such as MRIs, has achieved promising results, with several studies identifying the model's residual as a potential biomarker for chronic disease states. In this study, we developed a brain age predictive model using a dataset of 1,220 U.S. veterans (18--80 years) and convolutional neural networks (CNNs) trained on two-dimensional slices of axial T2-weighted fast spin-echo and T2-weighted fluid attenuated inversion recovery MRI images. The model, incorporating a degree-3 polynomial ensemble, achieved an $R^{2}$ of 0.816 on the testing set. Images were acquired at the level of the anterior commissure and the frontal horns of the lateral ventricles. Residual analysis was performed to assess its potential as a biomarker for five ICD-coded conditions: hypertension (HTN), diabetes mellitus (DM), mild traumatic brain injury (mTBI), illicit substance abuse/dependence (SAD), and alcohol abuse/dependence (AAD). Residuals grouped by the number of ICD-coded conditions demonstrated different trends that were statistically significant ($p = 0.002$), suggesting a relationship between disease states and predicted brain age. This association was particularly pronounced in patients over 49 years, where negative residuals (indicating advanced brain aging) correlated with the presence of multiple ICD codes. These findings support the potential of residuals as biomarkers for detecting latent health conditions.
