CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds

Min-Hsuan Yeh; Ruyuan Wan; Ting-Hao 'Kenneth' Huang

CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds

Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao 'Kenneth' Huang

TL;DR

CoLoFa, the largest known logical fallacy dataset, containing 7,706 comments for 648 news articles, with each comment labeled for fallacy presence and type is introduced, showing that combining crowdsourcing and LLMs enables us to more effectively construct datasets for complex linguistic phenomena that crowd workers find challenging to produce on their own.

Abstract

Detecting logical fallacies in texts can help users spot argument flaws, but automating this detection is not easy. Manually annotating fallacies in large-scale, real-world text data to create datasets for developing and validating detection models is costly. This paper introduces CoCoLoFa, the largest known logical fallacy dataset, containing 7,706 comments for 648 news articles, with each comment labeled for fallacy presence and type. We recruited 143 crowd workers to write comments embodying specific fallacy types (e.g., slippery slope) in response to news articles. Recognizing the complexity of this writing task, we built an LLM-powered assistant into the workers' interface to aid in drafting and refining their comments. Experts rated the writing quality and labeling validity of CoCoLoFa as high and reliable. BERT-based models fine-tuned using CoCoLoFa achieved the highest fallacy detection (F1=0.86) and classification (F1=0.87) performance on its test set, outperforming the state-of-the-art LLMs. Our work shows that combining crowdsourcing and LLMs enables us to more effectively construct datasets for complex linguistic phenomena that crowd workers find challenging to produce on their own.

CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds

TL;DR

Abstract

Paper Structure (71 sections, 2 equations, 4 figures, 10 tables)

This paper contains 71 sections, 2 equations, 4 figures, 10 tables.

Introduction
Related Work
Logical Fallacy Datasets.
LLM-Assisted Crowdsourced Data Creation.
CoCoLoFa Dataset Construction
Selecting News Articles
Fallacy Types Included in CoCoLoFa
Collecting Comments with Specified Logical Fallacies from Crowd Workers Assisted by LLMs
Step 1: Read the News Article.
Step 2: Answer Attention-Check Questions about the News.
Step 3: Draft a Comment Containing the Specified Logical Fallacy and Revise with LLMs.
Rationale for the Workflow Design.
Implementation Details
Four Rounds of Data Collection.
Probability of Each Fallacy Type.
...and 56 more sections

Figures (4)

Figure 1: Examples from CoCoLoFa. For each news article, we hired crowd workers to form a thread of comment. Each worker was assigned to write a comment with a specific type of logical fallacy (or a neutral argument) in response to the article.
Figure 2: Different components in the task interface: A) The news article and comments, B) Questions for sanity check, C) Instruction of writing fallacious comments, D) Text box and the drop-down list for choosing the responded comment, E) GPT-4 generated guideline and example.
Figure 3: The confusion matrix of the annotation between two experts. Most of the disagreement happened when determining if a comment is fallacious or not.
Figure 4: The confusion matrix of the annotation agreement.

CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds

TL;DR

Abstract

CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds

Authors

TL;DR

Abstract

Table of Contents

Figures (4)