Large Language Models as Proxies for Theories of Human Linguistic Cognition
Imry Ziv, Nur Lan, Emmanuel Chemla, Roni Katzir
TL;DR
The paper investigates whether current large language models can function as proxies for relatively linguistically-neutral theories of human linguistic cognition (HLC), contrasting the LLM Theory with the Proxy View and a concrete neutral framework $H_3$. Through two lines of inquiry—alignment with the stimulus via ATB/PG/TTE tests and cross-linguistic typology via perturbations across multiple languages—the authors find that LLMs generally fail to acquire key linguistic patterns and sometimes even predict easier learning for typologically unattested variants. They argue that these results provide limited support for linguistically-neutral theories and offer a pragmatic critique of the Proxy View, emphasizing the need for explicit theories and rigorous, detail-oriented evaluation to make LLMs scientifically informative for HLC. The work highlights the boundaries of current LLMs as tools for cognitive linguistics and calls for deeper theoretical specification and methodological precision in future proxy-based analyses.
Abstract
We consider the possible role of current large language models (LLMs) in the study of human linguistic cognition. We focus on the use of such models as proxies for theories of cognition that are relatively linguistically-neutral in their representations and learning but differ from current LLMs in key ways. We illustrate this potential use of LLMs as proxies for theories of cognition in the context of two kinds of questions: (a) whether the target theory accounts for the acquisition of a given pattern from a given corpus; and (b) whether the target theory makes a given typologically-attested pattern easier to acquire than another, typologically-unattested pattern. For each of the two questions we show, building on recent literature, how current LLMs can potentially be of help, but we note that at present this help is quite limited.
