Emergent Word Order Universals from Cognitively-Motivated Language Models
Tatsuki Kuribayashi, Ryo Ueda, Ryo Yoshida, Yohei Oseki, Ted Briscoe, Timothy Baldwin
TL;DR
The paper investigates whether cognitively plausible biases in language models can explain emergent word-order universals observed across languages. By training a range of standard and cognitively-motivated LMs on artificial languages with 64 word-order configurations derived from six binary parameters, the authors relate typological frequency to model-driven processing costs via perplexity. They find that syntactic biases, left-corner parsing strategies, and memory limitations generally yield higher alignment with attested word-order distributions than standard models, with global and local correlations supporting the link between cognitive biases and typology. However, certain human-like preferences, such as agent-first (SOV) tendencies, remain only partially explained, indicating the need for additional factors beyond surprisal and pointing to the usefulness and limits of cognitively-motivated LMs for modeling language universals. The work demonstrates a principled way to connect processing efficiency, predictability, and word-order typology, while outlining paths to refine artificial data and extend the framework to capture more human-like biases.
Abstract
The world's languages exhibit certain so-called typological or implicational universals; for example, Subject-Object-Verb (SOV) languages typically use postpositions. Explaining the source of such biases is a key goal of linguistics. We study word-order universals through a computational simulation with language models (LMs). Our experiments show that typologically-typical word orders tend to have lower perplexity estimated by LMs with cognitively plausible biases: syntactic biases, specific parsing strategies, and memory limitations. This suggests that the interplay of cognitive biases and predictability (perplexity) can explain many aspects of word-order universals. It also showcases the advantage of cognitively-motivated LMs, typically employed in cognitive modeling, in the simulation of language universals.
