Opening the black box of language acquisition
Jérôme Michaud, Anna Jon-and
TL;DR
The paper presents a minimal, cognitively plausible architecture for language learning that replaces neural networks with sequence memory and hierarchical chunking, learned via error-correction temporal-difference reinforcement. Framed as a Markov Decision Process, the learner identifies sentence boundaries in input streams generated by probabilistic context-free grammars, using state–action values $Q(s,a)$ and a softmax policy to drive decisions. Across NVN baselines and increasingly complex artificial languages (MD, RelClause, ComplexNP), the model demonstrates (i) successful learning from small data, (ii) robust extraction and reuse of grammatical chunks, and (iii) emergent structures that parallel human language learning under memory constraints. The work highlights the potential sufficiency of simple, cognitively grounded mechanisms—sequence memory, chunking, and TD reinforcement—for grammar emergence, offering a tractable, interpretable alternative to opaque deep learning approaches and informing theories of human language acquisition.
Abstract
Recent advances in large language models using deep learning techniques have renewed interest on how languages can be learned from data. However, it is unclear whether or how these models represent grammatical information from the learned languages. In addition, the models must be pre-trained on large corpora before they can be used. In this work, we propose an alternative, more transparent and cognitively plausible architecture for learning language. Instead of using deep learning, our approach uses a minimal cognitive architecture based on sequence memory and chunking. The learning mechanism is based on the principles of reinforcement learning. We test our architecture on a number of natural-like toy languages. Results show that the model can learn these artificial languages from scratch and extract grammatical information that supports learning. Our study demonstrates the power of this simple architecture and stresses the importance of sequence memory as a key component of the language learning process. Since other animals do not seem to have a faithful sequence memory, this may explain why only humans have developed complex languages.
