Pragmatic Theories Enhance Understanding of Implied Meanings in LLMs
Takuma Sato, Seiya Kawano, Koichiro Yoshino
TL;DR
This work shows that injecting concise summaries of Gricean pragmatics and Relevance Theory into zero-shot prompting can bootstrap LLMs to better infer implied meanings without task-specific guidance, achieving up to 9.6% higher accuracy on pragmatic reasoning benchmarks. The proposed Gricean and Relevance Theory prompts guide in-context reasoning, often outperforming baselines and, for some models, reaching or exceeding human performance, with Gricean prompting showing the strongest and most consistent gains. Analyses by phenomenon reveal the approach is especially effective for irony and is generally robust across open and closed models, though metaphoric and maxim-like cases can remain challenging. Additional experiments rule out simple confounds, supporting a genuine effect of theory-informed prompting, while acknowledging limitations related to context richness, cross-language generalization, and the need for deeper mechanistic understanding. Overall, the method offers a simple, broadly applicable prompt engineering technique to enhance pragmatic understanding in LLMs, with implications for dialogue systems and higher-level tasks requiring implicit meaning interpretation.
Abstract
The ability to accurately interpret implied meanings plays a crucial role in human communication and language use, and language models are also expected to possess this capability. This study demonstrates that providing language models with pragmatic theories as prompts is an effective in-context learning approach for tasks to understand implied meanings. Specifically, we propose an approach in which an overview of pragmatic theories, such as Gricean pragmatics and Relevance Theory, is presented as a prompt to the language model, guiding it through a step-by-step reasoning process to derive a final interpretation. Experimental results showed that, compared to the baseline, which prompts intermediate reasoning without presenting pragmatic theories (0-shot Chain-of-Thought), our methods enabled language models to achieve up to 9.6\% higher scores on pragmatic reasoning tasks. Furthermore, we show that even without explaining the details of pragmatic theories, merely mentioning their names in the prompt leads to a certain performance improvement (around 1-3%) in larger models compared to the baseline.
