The Emergence of Grammar through Reinforcement Learning
Stephen Wechsler, James W. Shearer, Katrin Erk
TL;DR
This work shows how complex grammatical systems can emerge from reinforcement learning driven by speakers’ expressive preferences. By embedding message probabilities into Harley-Roth-Erev learning, the authors derive a fundamental model (Cat Walking in Grass) where semantic composition and interpretation arise as convergence results, and forgetting accelerates convergence while cross-speaker diversity remains compatible with a single emergent grammar. The paper extends this framework with similarity-based learning across verbs, recursion, and explicit forms, culminating in a Form Competition Model that accounts for grammaticalization and case-making in languages like English. The combination of analytic theorems and numerical simulations provides a rigorous usage-based foundation for grammar emergence, with two historical English case studies illustrating the model’s explanatory power and efficiency considerations. The approach offers a quantitative, testable path for connecting cognitive learning processes to historical and cross-linguistic grammar structure.
Abstract
The evolution of grammatical systems of syntactic and semantic composition is modeled here with a novel application of reinforcement learning theory. To test the functionalist thesis that speakers' expressive purposes shape their language, we include within the model a probability distribution over different messages that could be expressed in a given context. The proposed learning and production algorithm then breaks down language learning into a sequence of simple steps, such that each step benefits from the message probabilities. The results are presented in the form of numerical simulations of language histories and analytic proofs. The potential for applying these mathematical models to the study of natural language is illustrated with two case studies from the history of English.
