Large Language Models Are Better Logical Fallacy Reasoners with Counterargument, Explanation, and Goal-Aware Prompt Formulation

Jiwon Jeong; Hyeju Jang; Hogun Park

Large Language Models Are Better Logical Fallacy Reasoners with Counterargument, Explanation, and Goal-Aware Prompt Formulation

Jiwon Jeong, Hyeju Jang, Hogun Park

TL;DR

This paper tackles the difficulty of detecting logical fallacies in natural language by introducing a prompt-engineered approach that injects implicit contextual information—Counterargument, Explanation, and Goal—into LLM prompts. It generates context-aware queries, ranks them by confidence, and uses this ranking to inform final classification, enabling both zero-shot and fine-tuned performance gains across five diverse fallacy datasets (29 types). The method yields substantial improvements over state-of-the-art baselines, with Macro-F1 gains up to 0.60 in zero-shot and up to 0.45 in supervised settings, and offers extensive analyses on calibration, query importance, and robustness. The work demonstrates that structured, multi-perspective prompts with confidence-based ranking can significantly enhance logical reasoning in LLMs and provides code for reproducibility and broader applicability.

Abstract

The advancement of Large Language Models (LLMs) has greatly improved our ability to process complex language. However, accurately detecting logical fallacies remains a significant challenge. This study presents a novel and effective prompt formulation approach for logical fallacy detection, applicable in both supervised (fine-tuned) and unsupervised (zero-shot) settings. Our method enriches input text incorporating implicit contextual information -- counterarguments, explanations, and goals -- which we query for validity within the context of the argument. We then rank these queries based on confidence scores to inform classification. We evaluate our approach across multiple datasets from 5 domains, covering 29 distinct fallacy types, using models from the GPT and LLaMA series. The results show substantial improvements over state-of-the-art models, with F1 score increases of up to 0.60 in zero-shot settings and up to 0.45 in fine-tuned models. Extensive analyses further illustrate why and how our method excels.

Large Language Models Are Better Logical Fallacy Reasoners with Counterargument, Explanation, and Goal-Aware Prompt Formulation

TL;DR

Abstract

Large Language Models Are Better Logical Fallacy Reasoners with Counterargument, Explanation, and Goal-Aware Prompt Formulation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)