Table of Contents
Fetching ...

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

Rrubaa Panchendrarajan, Arkaitz Zubiaga

TL;DR

A comprehensive survey of existing multilingual claim detection research focused on multilingual misinformation and presents state-of-the-art multilingual claim detection research categorized into three key factors of the problem, verifiability, priority, and similarity.

Abstract

Automated fact-checking has drawn considerable attention over the past few decades due to the increase in the diffusion of misinformation on online platforms. This is often carried out as a sequence of tasks comprising (i) the detection of sentences circulating in online platforms which constitute claims needing verification, followed by (ii) the verification process of those claims. This survey focuses on the former, by discussing existing efforts towards detecting claims needing fact-checking, with a particular focus on multilingual data and methods. This is a challenging and fertile direction where existing methods are yet far from matching human performance due to the profoundly challenging nature of the issue. Especially, the dissemination of information across multiple social platforms, articulated in multiple languages and modalities demands more generalized solutions for combating misinformation. Focusing on multilingual misinformation, we present a comprehensive survey of existing multilingual claim detection research. We present state-of-the-art multilingual claim detection research categorized into three key factors of the problem, verifiability, priority, and similarity. Further, we present a detailed overview of the existing multilingual datasets along with the challenges and suggest possible future advancements.

Claim Detection for Automated Fact-checking: A Survey on Monolingual, Multilingual and Cross-Lingual Research

TL;DR

A comprehensive survey of existing multilingual claim detection research focused on multilingual misinformation and presents state-of-the-art multilingual claim detection research categorized into three key factors of the problem, verifiability, priority, and similarity.

Abstract

Automated fact-checking has drawn considerable attention over the past few decades due to the increase in the diffusion of misinformation on online platforms. This is often carried out as a sequence of tasks comprising (i) the detection of sentences circulating in online platforms which constitute claims needing verification, followed by (ii) the verification process of those claims. This survey focuses on the former, by discussing existing efforts towards detecting claims needing fact-checking, with a particular focus on multilingual data and methods. This is a challenging and fertile direction where existing methods are yet far from matching human performance due to the profoundly challenging nature of the issue. Especially, the dissemination of information across multiple social platforms, articulated in multiple languages and modalities demands more generalized solutions for combating misinformation. Focusing on multilingual misinformation, we present a comprehensive survey of existing multilingual claim detection research. We present state-of-the-art multilingual claim detection research categorized into three key factors of the problem, verifiability, priority, and similarity. Further, we present a detailed overview of the existing multilingual datasets along with the challenges and suggest possible future advancements.
Paper Structure (33 sections, 5 figures, 6 tables)

This paper contains 33 sections, 5 figures, 6 tables.

Figures (5)

  • Figure 1: Claim detection tasks.
  • Figure 2: Fact-checking Pipeline
  • Figure 3: Stages of a claim
  • Figure 4: Monolingual vs Multilingual vs Cross-lingual Claim Detection.
  • Figure 5: Statistics of CheckThat 2022 verifiable and check-worthy datasets.