Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication
Tom Kouwenhoven, Max Peeperkorn, Roy de Kleijn, Tessa Verhoef
TL;DR
This work investigates how the inductive biases of humans and large language models (LLMs) shape emergent referential languages in a classic referential game. It extends prior emergent-communication work by comparing language evolution in Human-Human, LLM-LLM, and Human-LLM pairings, using a 27-meaning space and in-context learning for LLMs. The results show that structured, referentially grounded vocabularies emerge under all conditions, with human-optimized languages being more diverse and efficient, while LLM-optimized vocabularies tend toward degeneracy; crucially, Human-LLM collaboration yields vocabularies that are more human-like, suggesting cross-domain flexibility. The study highlights communicative success as a driving reward signal and argues for training LLMs with human interaction to maintain alignment and grounding in human communication. Together, these findings illuminate how LLM biases influence language dynamics and point to human-in-the-loop training as a pathway to robust human–machine communication.
Abstract
Languages are shaped by the inductive biases of their users. Using a classical referential game, we investigate how artificial languages evolve when optimised for inductive biases in humans and large language models (LLMs) via Human-Human, LLM-LLM and Human-LLM experiments. We show that referentially grounded vocabularies emerge that enable reliable communication in all conditions, even when humans \textit{and} LLMs collaborate. Comparisons between conditions reveal that languages optimised for LLMs subtly differ from those optimised for humans. Interestingly, interactions between humans and LLMs alleviate these differences and result in vocabularies more human-like than LLM-like. These findings advance our understanding of the role inductive biases in LLMs play in the dynamic nature of human language and contribute to maintaining alignment in human and machine communication. In particular, our work underscores the need to think of new LLM training methods that include human interaction and shows that using communicative success as a reward signal can be a fruitful, novel direction.
