Reason from Fallacy: Enhancing Large Language Models' Logical Reasoning through Logical Fallacy Understanding
Yanda Li, Dixuan Wang, Jiaqing Liang, Guochao Jiang, Qianyu He, Yanghua Xiao, Deqing Yang
TL;DR
This work tackles the gap in large language models' logical reasoning by focusing on logical fallacy understanding (LFU). It introduces LFUD, a GPT-4–driven dataset containing 4,020 LFU-focused QA instances across 12 fallacy types derived from 67 propositions, designed around five LFU tasks spanning WHAT, WHY, and HOW. Through fine-tuning experiments on LFUD (vs. baselines like LOGIC), the authors demonstrate significant improvements in LFU and general logical reasoning across multiple benchmarks, with clear evidence of cross-task transfer to Task5. The approach emphasizes data-driven LFU augmentation, robust validation, and insights into how task composition and fallacy diversity impact LFU learning, offering a practical resource for advancing causal, trustworthy reasoning in LLMs.
Abstract
Large Language Models (LLMs) have demonstrated good performance in many reasoning tasks, but they still struggle with some complicated reasoning tasks including logical reasoning. One non-negligible reason for LLMs' suboptimal performance on logical reasoning is their overlooking of understanding logical fallacies correctly. To evaluate LLMs' capability of logical fallacy understanding (LFU), we propose five concrete tasks from three cognitive dimensions of WHAT, WHY, and HOW in this paper. Towards these LFU tasks, we have successfully constructed a new dataset LFUD based on GPT-4 accompanied by a little human effort. Our extensive experiments justify that our LFUD can be used not only to evaluate LLMs' LFU capability, but also to fine-tune LLMs to obtain significantly enhanced performance on logical reasoning.
