NeuroMambaLLM: Dynamic Graph Learning of fMRI Functional Connectivity in Autistic Brains Using Mamba and Language Model Reasoning
Yasaman Torabi, Parsa Razmara, Hamed Ajorlou, Bardia Baraeinejad
TL;DR
NeuroMambaLLM addresses the limitations of static fMRI analysis by learning time-varying functional connectivity directly from raw BOLD signals and aligning these dynamic graphs with a frozen LLM through LoRA-tuned adapters. The method combines a dynamic latent graph encoder with a linear-time selective state-space backbone (Mamba) and a cross-modal alignment mechanism to enable both ASD classification and text-based reasoning about neural dynamics. Key contributions include: (i) an end-to-end dynamic graph learner that eschews fixed connectivity priors, (ii) efficient long-range temporal modelling suitable for long fMRI sequences, and (iii) brain–LLM alignment via brain-summary tokens that produce clinically grounded textual reports. The approach yields competitive accuracy on ABIDE with interpretable explanations, highlighting potential for clinically useful reasoning beyond black-box predictions, and sets the stage for multimodal extensions and broader generalization across datasets.
Abstract
Large Language Models (LLMs) have demonstrated strong semantic reasoning across multimodal domains. However, their integration with graph-based models of brain connectivity remains limited. In addition, most existing fMRI analysis methods rely on static Functional Connectivity (FC) representations, which obscure transient neural dynamics critical for neurodevelopmental disorders such as autism. Recent state-space approaches, including Mamba, model temporal structure efficiently, but are typically used as standalone feature extractors without explicit high-level reasoning. We propose NeuroMambaLLM, an end-to-end framework that integrates dynamic latent graph learning and selective state-space temporal modelling with LLMs. The proposed method learns the functional connectivity dynamically from raw Blood-Oxygen-Level-Dependent (BOLD) time series, replacing fixed correlation graphs with adaptive latent connectivity while suppressing motion-related artifacts and capturing long-range temporal dependencies. The resulting dynamic brain representations are projected into the embedding space of an LLM model, where the base language model remains frozen and lightweight low-rank adaptation (LoRA) modules are trained for parameter-efficient alignment. This design enables the LLM to perform both diagnostic classification and language-based reasoning, allowing it to analyze dynamic fMRI patterns and generate clinically meaningful textual reports.
