The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models
Kefan Yu, Qingcheng Zeng, Weihao Xuan, Wanxin Li, Jingyi Wu, Rob Voigt
TL;DR
This work introduces AltPrag, a principled dataset that tests pragmatic competence in LLMs by contrasting two equally plausible but pragmatically distinct continuations. Using an LLM-as-a-judge framework and GPT-4o as the reference annotator, the study tracks how pragmatic understanding emerges across pretraining, supervised fine-tuning, and preference optimization, evaluating a wide range of open-source models. Key findings show that base models already possess non-trivial pragmatic sensitivity, and that both scale and alignment stages (SFT and DPO) yield measurable gains, with cognitive-pragmatic reasoning strengthening earlier and sociopragmatic sensitivity increasing with DPO. The results highlight pragmatics as an emergent, compositional property of LLM training and offer guidance for aligning models with human communicative norms through staged training and robust evaluation. The work also demonstrates the critical role of pretraining data scale and quality in shaping pragmatic abilities, suggesting that foundational data choices can substantially influence downstream communicative behavior.
Abstract
Current large language models (LLMs) have demonstrated emerging capabilities in social intelligence tasks, including implicature resolution and theory-of-mind reasoning, both of which require substantial pragmatic understanding. However, how LLMs acquire this pragmatic competence throughout the training process remains poorly understood. In this work, we introduce ALTPRAG, a dataset grounded in the pragmatic concept of alternatives, to evaluate whether LLMs at different training stages can accurately infer nuanced speaker intentions. Each instance pairs two equally plausible yet pragmatically divergent continuations and requires the model to (i) infer the speaker's intended meaning and (ii) explain when and why a speaker would choose one utterance over its alternative, thus directly probing pragmatic competence through contrastive reasoning. We systematically evaluate 22 LLMs across 3 key training stages: after pre-training, supervised fine-tuning (SFT), and preference optimization, to examine the development of pragmatic competence. Our results show that even base models exhibit notable sensitivity to pragmatic cues, which improves consistently with increases in model and data scale. Additionally, SFT and RLHF contribute further gains, particularly in cognitive-pragmatic scenarios. These findings highlight pragmatic competence as an emergent and compositional property of LLM training and offer new insights for aligning models with human communicative norms.
