Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder
Yuejiao Wang, Xianmin Gong, Lingwei Meng, Xixin Wu, Helen Meng
TL;DR
The paper addresses early detection of neurocognitive disorder-related language changes by applying LLM-based fMRI encoding to older adults, a population rarely studied with such models. It builds an encoding framework using LlaMA2-Cantonese context embeddings and voxel-wise ridge regression to derive brain scores that relate contextual language representations to fMRI signals. The study finds that higher MoCA scores correspond to stronger brain scores, with the strongest brain–cognition associations in the middle temporal gyrus (MTG) and related language ROIs, and shows modest or negligible gains from Cantonese-specific pretraining. This work demonstrates the potential of interpretable, language-model-driven fMRI encoders for early NCD detection and highlights the MTG, AG, and SFG as critical regions for language processing in aging and pathology.
Abstract
Functional magnetic resonance imaging (fMRI) is essential for developing encoding models that identify functional changes in language-related brain areas of individuals with Neurocognitive Disorders (NCD). While large language model (LLM)-based fMRI encoding has shown promise, existing studies predominantly focus on healthy, young adults, overlooking older NCD populations and cognitive level correlations. This paper explores language-related functional changes in older NCD adults using LLM-based fMRI encoding and brain scores, addressing current limitations. We analyze the correlation between brain scores and cognitive scores at both whole-brain and language-related ROI levels. Our findings reveal that higher cognitive abilities correspond to better brain scores, with correlations peaking in the middle temporal gyrus. This study highlights the potential of fMRI encoding models and brain scores for detecting early functional changes in NCD patients.
