Investigating the Timescales of Language Processing with EEG and Language Models
Davide Turco, Conor Houghton
TL;DR
This work analyzes the temporal dynamics of language processing by aligning word representations from a pre-trained transformer with EEG data via a Temporal Response Function (TRF). By comparing embeddings from the GPT-2 embedding layer ($l=0$) with a deep layer ($l=8$) and isolating part-of-speech (POS) information through Linear Discriminant Analysis (LDA), the study reveals layer-specific TRF patterns that distinguish lexical from compositional processing and shows POS-related neural influences at later time windows. The results demonstrate distinct spatial and temporal EEG signatures for different transformer representations and provide evidence that POS-derived features modulate neural activity in the $>200$ ms range. Overall, the work highlights EEG’s utility for probing language processing at fine timescales and helps bridge artificial language models with human neural responses.
Abstract
This study explores the temporal dynamics of language processing by examining the alignment between word representations from a pre-trained transformer-based language model, and EEG data. Using a Temporal Response Function (TRF) model, we investigate how neural activity corresponds to model representations across different layers, revealing insights into the interaction between artificial language models and brain responses during language comprehension. Our analysis reveals patterns in TRFs from distinct layers, highlighting varying contributions to lexical and compositional processing. Additionally, we used linear discriminant analysis (LDA) to isolate part-of-speech (POS) representations, offering insights into their influence on neural responses and the underlying mechanisms of syntactic processing. These findings underscore EEG's utility for probing language processing dynamics with high temporal resolution. By bridging artificial language models and neural activity, this study advances our understanding of their interaction at fine timescales.
