Table of Contents
Fetching ...

Brain-like Functional Organization within Large Language Models

Haiyang Sun, Lin Zhao, Zihao Wu, Xiaohui Gao, Yutao Hu, Mengfei Zuo, Wei Zhang, Junwei Han, Tianming Liu, Xintao Hu

TL;DR

This research represents the first exploration of brain-like functional organization within LLMs, offering novel insights to inform the development of artificial general intelligence (AGI) with human brain principles.

Abstract

The human brain has long inspired the pursuit of artificial intelligence (AI). Recently, neuroimaging studies provide compelling evidence of alignment between the computational representation of artificial neural networks (ANNs) and the neural responses of the human brain to stimuli, suggesting that ANNs may employ brain-like information processing strategies. While such alignment has been observed across sensory modalities--visual, auditory, and linguistic--much of the focus has been on the behaviors of artificial neurons (ANs) at the population level, leaving the functional organization of individual ANs that facilitates such brain-like processes largely unexplored. In this study, we bridge this gap by directly coupling sub-groups of artificial neurons with functional brain networks (FBNs), the foundational organizational structure of the human brain. Specifically, we extract representative patterns from temporal responses of ANs in large language models (LLMs), and use them as fixed regressors to construct voxel-wise encoding models to predict brain activity recorded by functional magnetic resonance imaging (fMRI). This framework links the AN sub-groups to FBNs, enabling the delineation of brain-like functional organization within LLMs. Our findings reveal that LLMs (BERT and Llama 1-3) exhibit brain-like functional architecture, with sub-groups of artificial neurons mirroring the organizational patterns of well-established FBNs. Notably, the brain-like functional organization of LLMs evolves with the increased sophistication and capability, achieving an improved balance between the diversity of computational behaviors and the consistency of functional specializations. This research represents the first exploration of brain-like functional organization within LLMs, offering novel insights to inform the development of artificial general intelligence (AGI) with human brain principles.

Brain-like Functional Organization within Large Language Models

TL;DR

This research represents the first exploration of brain-like functional organization within LLMs, offering novel insights to inform the development of artificial general intelligence (AGI) with human brain principles.

Abstract

The human brain has long inspired the pursuit of artificial intelligence (AI). Recently, neuroimaging studies provide compelling evidence of alignment between the computational representation of artificial neural networks (ANNs) and the neural responses of the human brain to stimuli, suggesting that ANNs may employ brain-like information processing strategies. While such alignment has been observed across sensory modalities--visual, auditory, and linguistic--much of the focus has been on the behaviors of artificial neurons (ANs) at the population level, leaving the functional organization of individual ANs that facilitates such brain-like processes largely unexplored. In this study, we bridge this gap by directly coupling sub-groups of artificial neurons with functional brain networks (FBNs), the foundational organizational structure of the human brain. Specifically, we extract representative patterns from temporal responses of ANs in large language models (LLMs), and use them as fixed regressors to construct voxel-wise encoding models to predict brain activity recorded by functional magnetic resonance imaging (fMRI). This framework links the AN sub-groups to FBNs, enabling the delineation of brain-like functional organization within LLMs. Our findings reveal that LLMs (BERT and Llama 1-3) exhibit brain-like functional architecture, with sub-groups of artificial neurons mirroring the organizational patterns of well-established FBNs. Notably, the brain-like functional organization of LLMs evolves with the increased sophistication and capability, achieving an improved balance between the diversity of computational behaviors and the consistency of functional specializations. This research represents the first exploration of brain-like functional organization within LLMs, offering novel insights to inform the development of artificial general intelligence (AGI) with human brain principles.

Paper Structure

This paper contains 18 sections, 2 equations, 15 figures.

Figures (15)

  • Figure 1: The study overview. We learn representative patterns $\mathbf{D_\textit{AN}}$ (b) from the temporal responses of ANs in LLMs (a) and use $\mathbf{D_\textit{AN}}$ as regressors to reconstruct fMRI brain activity recorded by fMRI (c). Atoms in $\mathbf{D_\textit{AN}}$ selectively activate specific brain areas/networks.
  • Figure 2: The $\mathrm{R^2}$ values in the sparse representation of temporal responses of ANs (a) and in the sparse reconstruction of fMRI activity (b) using $\mathbf{D_\textit{AN}}$. (c-f): The spatial distribution of the $\mathrm{R^2}s$ in BERT and Llama family visualized on the cortical surface, respectively.
  • Figure 3: (a) Two exemplar brain maps corresponding to the atom #9 and atom #17 in Llama3. (b) Automatic brain network labelling of atom #9, illustrating the activation of the language, salienceA and salienceB networks, and the deactivation of the lateral visual (LatVis) cortex.
  • Figure 4: (a) The number of brain maps associated with different FBNs. (b) The distribution of the brain maps associated with different number of FBNs.
  • Figure 5: The detailed FBN components involved in brain maps. The color-coding represents the Dice coefficients obtained from FBN labeling, which measures the spatial overlap between brain maps and FBNs. A negative Dice value here indicates deactivation of FBNs in a given brain map. Braces are used to highlight brain maps that have identical FBN labels.
  • ...and 10 more figures