Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

Rrubaa Panchendrarajan; Arkaitz Zubiaga

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

Rrubaa Panchendrarajan, Arkaitz Zubiaga

TL;DR

A comprehensive survey of existing multilingual claim detection research focused on multilingual misinformation and presents state-of-the-art multilingual claim detection research categorized into three key factors of the problem, verifiability, priority, and similarity.

Abstract

Automated fact-checking has drawn considerable attention over the past few decades due to the increase in the diffusion of misinformation on online platforms. This is often carried out as a sequence of tasks comprising (i) the detection of sentences circulating in online platforms which constitute claims needing verification, followed by (ii) the verification process of those claims. This survey focuses on the former, by discussing existing efforts towards detecting claims needing fact-checking, with a particular focus on multilingual data and methods. This is a challenging and fertile direction where existing methods are yet far from matching human performance due to the profoundly challenging nature of the issue. Especially, the dissemination of information across multiple social platforms, articulated in multiple languages and modalities demands more generalized solutions for combating misinformation. Focusing on multilingual misinformation, we present a comprehensive survey of existing multilingual claim detection research. We present state-of-the-art multilingual claim detection research categorized into three key factors of the problem, verifiability, priority, and similarity. Further, we present a detailed overview of the existing multilingual datasets along with the challenges and suggest possible future advancements.

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

TL;DR

Abstract

Paper Structure (33 sections, 5 figures, 6 tables)

This paper contains 33 sections, 5 figures, 6 tables.

Introduction
Background: Automated Fact-checking Pipeline
Definition of Claim
Cross-lingual Claim Detection
Transformer Models
Verifiable Claim Detection
Datasets
Cross-lingual Claim Detection
Multilingual Claim Detection
Monolingual Claim Detection
Evaluation
Discussion
Claim Prioritization
Datasets
Cross-lingual Claim Prioritization
...and 18 more sections

Figures (5)

Figure 1: Claim detection tasks.
Figure 2: Fact-checking Pipeline
Figure 3: Stages of a claim
Figure 4: Monolingual vs Multilingual vs Cross-lingual Claim Detection.
Figure 5: Statistics of CheckThat 2022 verifiable and check-worthy datasets.

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

TL;DR

Abstract

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

Authors

TL;DR

Abstract

Table of Contents

Figures (5)