Comparing human and LLM politeness strategies in free production
Haoran Zhao, Robert D. Hawkins
TL;DR
This work interrogates whether large language models can reproduce human politeness strategies and adapt them to context. By combining constrained, multiple-choice tasks with open-ended generation, it demonstrates that models with $ ext{≥}70$B parameters exhibit substantial pragmatic competence and can even be preferred by human evaluators when generating open-ended responses. However, linguistic analyses reveal systematic differences: LLMs overuse negative politeness (minimizing imposition) across contexts, potentially causing pragmatic misinterpretations and alignment concerns. The findings illuminate how training objectives shape subtle relational patterns in AI, with implications for safer, more interpretable human–AI interactions and for refining pragmatics-aligned instruction and evaluation. Overall, the work provides a nuanced view of pragmatic alignment in AI systems, balancing evidence of human-like competence with caution about underlying strategy biases.
Abstract
Polite speech poses a fundamental alignment challenge for large language models (LLMs). Humans deploy a rich repertoire of linguistic strategies to balance informational and social goals -- from positive approaches that build rapport (compliments, expressions of interest) to negative strategies that minimize imposition (hedging, indirectness). We investigate whether LLMs employ a similarly context-sensitive repertoire by comparing human and LLM responses in both constrained and open-ended production tasks. We find that larger models ($\ge$70B parameters) successfully replicate key preferences from the computational pragmatics literature, and human evaluators surprisingly prefer LLM-generated responses in open-ended contexts. However, further linguistic analyses reveal that models disproportionately rely on negative politeness strategies even in positive contexts, potentially leading to misinterpretations. While modern LLMs demonstrate an impressive handle on politeness strategies, these subtle differences raise important questions about pragmatic alignment in AI systems.
