Simulating the Emergence of Differential Case Marking with Communicating Neural-Network Agents
Yuchen Lian, Arianna Bisazza, Tessa Verhoef
TL;DR
This study investigates the emergence of Differential Case Marking (DCM) using NeLLCom-X neural agents that first learn a miniature artificial language and then engage in communicative interactions. It shows that DCM-like patterns do not arise from learning alone but emerge when agents communicate, aligning with human studies that emphasize interaction pressures. The work contrasts Dominant and Neutral word-order languages to reveal how typicality and order influence marker usage and demonstrates that neural agents can reproduce human-like DOM effects under communication, while highlighting biases distinct from human learners. Overall, NeLLCom-X provides a scalable, controllable framework to complement experimental research on language evolution and the emergence of case marking.
Abstract
Differential Case Marking (DCM) refers to the phenomenon where grammatical case marking is applied selectively based on semantic, pragmatic, or other factors. The emergence of DCM has been studied in artificial language learning experiments with human participants, which were specifically aimed at disentangling the effects of learning from those of communication (Smith & Culbertson, 2020). Multi-agent reinforcement learning frameworks based on neural networks have gained significant interest to simulate the emergence of human-like linguistic phenomena. In this study, we employ such a framework in which agents first acquire an artificial language before engaging in communicative interactions, enabling direct comparisons to human result. Using a very generic communication optimization algorithm and neural-network learners that have no prior experience with language or semantic preferences, our results demonstrate that learning alone does not lead to DCM, but when agents communicate, differential use of markers arises. This supports Smith and Culbertson (2020)'s findings that highlight the critical role of communication in shaping DCM and showcases the potential of neural-agent models to complement experimental research on language evolution.
