Do language models accommodate their users? A study of linguistic convergence
Terra Blevins, Susanne Schmalwieser, Benjamin Roth
TL;DR
The paper investigates whether language models linguistically converge to their users by grounding model completions in existing human dialogues. Using a synthetic paradigm across 16 models, three dialogue corpora, and multiple stylometric features, it shows that LLMs exhibit strong convergence to context, frequently surpassing random baselines and, in many cases, overconverging relative to human utterances. Convergence patterns vary by model family, training regime, and dataset, with pretrained models generally showing greater adaptation than instruction-tuned ones. The findings suggest that model convergence arises from pretraining dynamics rather than social goals, carrying implications for how we evaluate and deploy conversational AI and highlighting the need for user studies to understand perceptual effects on trust and interaction quality.
Abstract
While large language models (LLMs) are generally considered proficient in generating language, how similar their language usage is to that of humans remains understudied. In this paper, we test whether models exhibit linguistic convergence, a core pragmatic element of human language communication: do models adapt, or converge, to the linguistic patterns of their user? To answer this, we systematically compare model completions of existing dialogues to original human responses across sixteen language models, three dialogue corpora, and various stylometric features. We find that models strongly converge to the conversation's style, often significantly overfitting relative to the human baseline. While convergence patterns are often feature-specific, we observe consistent shifts in convergence across modeling settings, with instruction-tuned and larger models converging less than their pretrained and smaller counterparts. Given the differences in human and model convergence patterns, we hypothesize that the underlying mechanisms driving these behaviors are very different.
