Scaling and context steer LLMs along the same computational path as the human brain
Joséphine Raugel, Stéphane d'Ascoli, Jérémy Rapin, Valentin Wyart, Jean-Rémi King
TL;DR
The paper investigates whether large language models and the human brain process natural language in a shared computational sequence. By aligning MEG responses to word onsets with activations from 22 LLMs across architectures, the study shows that earlier LLM layers align with early brain responses and deeper layers with later responses, a pattern that holds across transformers and non-transformer architectures. Temporal alignment intensifies with model size and context length, follows a logarithmic growth with diminishing returns, and is not driven solely by word predictability. A significant correlation between temporal and alignment scores suggests that brain-like sequential processing emerges under certain conditions, offering insights for brain-inspired AI and the role of memory and context in language comprehension.
Abstract
Recent studies suggest that the representations learned by large language models (LLMs) are partially aligned to those of the human brain. However, whether and why this alignment score arises from a similar sequence of computations remains elusive. In this study, we explore this question by examining temporally-resolved brain signals of participants listening to 10 hours of an audiobook. We study these neural dynamics jointly with a benchmark encompassing 22 LLMs varying in size and architecture type. Our analyses confirm that LLMs and the brain generate representations in a similar order: specifically, activations in the initial layers of LLMs tend to best align with early brain responses, while the deeper layers of LLMs tend to best align with later brain responses. This brain-LLM alignment is consistent across transformers and recurrent architectures. However, its emergence depends on both model size and context length. Overall, this study sheds light on the sequential nature of computations and the factors underlying the partial convergence between biological and artificial neural networks.
