LLMs Struggle to Reject False Presuppositions when Misinformation Stakes are High
Judith Sieker, Clara Lachenmaier, Sina Zarrieß
TL;DR
This work investigates whether large language models (LLMs) reliably reject false presuppositions in political contexts, using a controlled German dataset and three models (GPT-4-o, LLaMa-3-8B, Mistral-7B-v03). By systematically varying presupposition trigger types, embedding contexts, scenario probabilities, and political distance, the study reveals that all models struggle to reject false presuppositions, with GPT performing best (84.08% rejection) but far from perfect. Embedding context strongly affects LLaMa but not GPT, while scenario probability universally drives responses; political distance disproportionately influences GPT. The findings highlight the utility of linguistic presupposition analysis for auditing LLMs against misinformation and underscore the need for targeted improvements to ensure more robust handling of embedded false information in real-world political discourse.
Abstract
This paper examines how LLMs handle false presuppositions and whether certain linguistic factors influence their responses to falsely presupposed content. Presuppositions subtly introduce information as given, making them highly effective at embedding disputable or false information. This raises concerns about whether LLMs, like humans, may fail to detect and correct misleading assumptions introduced as false presuppositions, even when the stakes of misinformation are high. Using a systematic approach based on linguistic presupposition analysis, we investigate the conditions under which LLMs are more or less sensitive to adopt or reject false presuppositions. Focusing on political contexts, we examine how factors like linguistic construction, political party, and scenario probability impact the recognition of false presuppositions. We conduct experiments with a newly created dataset and examine three LLMs: OpenAI's GPT-4-o, Meta's LLama-3-8B, and MistralAI's Mistral-7B-v03. Our results show that the models struggle to recognize false presuppositions, with performance varying by condition. This study highlights that linguistic presupposition analysis is a valuable tool for uncovering the reinforcement of political misinformation in LLM responses.
