How Entangled is Factuality and Deception in German?
Aswathy Velutharambath, Amelie Wührl, Roman Klinger
TL;DR
This work investigates the relationship between belief-based deception and factuality in German using the DeFaBel corpus, aiming to determine whether widely used deception cues generalize across languages and how deception interacts with fact verification. It evaluates traditional cue-based, transformer-based, and prompt-driven methods for deception detection, finding no robust cues in German and performance no better than random on the holdout set. The study further examines fact verification as an entailment task to assess whether deception and factuality confound predictions, revealing only modest and model-dependent effects, with non-factual and deceptive evidence posing particular challenges. Overall, the results highlight language- and context-specific limitations of current deception cues and models, underscoring the need for context-aware, multilingual, and explainable approaches in deception and verification tasks.
Abstract
The statement "The earth is flat" is factually inaccurate, but if someone truly believes and argues in its favor, it is not deceptive. Research on deception detection and fact checking often conflates factual accuracy with the truthfulness of statements. This assumption makes it difficult to (a) study subtle distinctions and interactions between the two and (b) gauge their effects on downstream tasks. The belief-based deception framework disentangles these properties by defining texts as deceptive when there is a mismatch between what people say and what they truly believe. In this study, we assess if presumed patterns of deception generalize to German language texts. We test the effectiveness of computational models in detecting deception using an established corpus of belief-based argumentation. Finally, we gauge the impact of deception on the downstream task of fact checking and explore if this property confounds verification models. Surprisingly, our analysis finds no correlation with established cues of deception. Previous work claimed that computational models can outperform humans in deception detection accuracy, however, our experiments show that both traditional and state-of-the-art models struggle with the task, performing no better than random guessing. For fact checking, we find that Natural Language Inference-based verification performs worse on non-factual and deceptive content, while prompting Large Language Models for the same task is less sensitive to these properties.
