Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models
Yunhao Zhang, Shaonan Wang, Xinyi Dong, Jiajun Yu, Chengqing Zong
TL;DR
The paper investigates whether neural language models (NLMs) genuinely best predict brain language representations or if psychologically plausible models (PPMs) can do so more effectively. It compares NLMs and PPMs across eight multi-modal cognitive datasets in English and Chinese, at word- and discourse-levels, using fMRI and eye-tracking data and region/voxel-level brain encoding analyses. The results show that PPMs, especially Embodied-Based Models (EBMs) and Network-Topological Models (NTMs), outperform NLMs in brain encoding across modalities and languages, challenging the view of universal NLM supremacy. These findings highlight distinct brain-region encoding patterns linked to specific representation types and suggest cross-language generality, informing model choice for cognitive neuroscience and language processing research.
Abstract
Neural language models, particularly large-scale ones, have been consistently proven to be most effective in predicting brain neural activity across a range of studies. However, previous research overlooked the comparison of these models with psychologically plausible ones. Moreover, evaluations were reliant on limited, single-modality, and English cognitive datasets. To address these questions, we conducted an analysis comparing encoding performance of various neural language models and psychologically plausible models. Our study utilized extensive multi-modal cognitive datasets, examining bilingual word and discourse levels. Surprisingly, our findings revealed that psychologically plausible models outperformed neural language models across diverse contexts, encompassing different modalities such as fMRI and eye-tracking, and spanning languages from English to Chinese. Among psychologically plausible models, the one incorporating embodied information emerged as particularly exceptional. This model demonstrated superior performance at both word and discourse levels, exhibiting robust prediction of brain activation across numerous regions in both English and Chinese.
