Table of Contents
Fetching ...

A Benchmark for Cross-Domain Argumentative Stance Classification on Social Media

Jiaqing Yuan, Ruijie Xi, Munindar P. Singh

TL;DR

This work proposes leveraging platform rules, readily available expert-curated content, and large language models to bypass the need for human annotation, and produces a multidomain benchmark comprising 4,498 topical claims and 30,961 arguments from three sources, spanning 21 domains.

Abstract

Argumentative stance classification plays a key role in identifying authors' viewpoints on specific topics. However, generating diverse pairs of argumentative sentences across various domains is challenging. Existing benchmarks often come from a single domain or focus on a limited set of topics. Additionally, manual annotation for accurate labeling is time-consuming and labor-intensive. To address these challenges, we propose leveraging platform rules, readily available expert-curated content, and large language models to bypass the need for human annotation. Our approach produces a multidomain benchmark comprising 4,498 topical claims and 30,961 arguments from three sources, spanning 21 domains. We benchmark the dataset in fully supervised, zero-shot, and few-shot settings, shedding light on the strengths and limitations of different methodologies. We release the dataset and code in this study at hidden for anonymity.

A Benchmark for Cross-Domain Argumentative Stance Classification on Social Media

TL;DR

This work proposes leveraging platform rules, readily available expert-curated content, and large language models to bypass the need for human annotation, and produces a multidomain benchmark comprising 4,498 topical claims and 30,961 arguments from three sources, spanning 21 domains.

Abstract

Argumentative stance classification plays a key role in identifying authors' viewpoints on specific topics. However, generating diverse pairs of argumentative sentences across various domains is challenging. Existing benchmarks often come from a single domain or focus on a limited set of topics. Additionally, manual annotation for accurate labeling is time-consuming and labor-intensive. To address these challenges, we propose leveraging platform rules, readily available expert-curated content, and large language models to bypass the need for human annotation. Our approach produces a multidomain benchmark comprising 4,498 topical claims and 30,961 arguments from three sources, spanning 21 domains. We benchmark the dataset in fully supervised, zero-shot, and few-shot settings, shedding light on the strengths and limitations of different methodologies. We release the dataset and code in this study at hidden for anonymity.

Paper Structure

This paper contains 48 sections, 3 figures, 14 tables.

Figures (3)

  • Figure 1: Domain distribution of the topics in our proposed dataset.
  • Figure 2: Lexical and semantic diversity scores across datasets and labels.
  • Figure 3: Performance of micro-F1 for BERT, T5, LLaMa on all partitions of the dataset with various training examples.