Table of Contents
Fetching ...

ProvocationProbe: Instigating Hate Speech Dataset from Twitter

Abhay Kumar, Vigneshwaran Shankaran, Rajesh Sharma

TL;DR

An annotated dataset designed to explore what distinguishes instigating hate speech from general hate speech is presented and the difference between hate speech and instigating hate speech is highlighted by identifying distinguishing features, such as targeted identity attacks and reasons for hate.

Abstract

In the recent years online social media platforms has been flooded with hateful remarks such as racism, sexism, homophobia etc. As a result, there have been many measures taken by various social media platforms to mitigate the spread of hate-speech over the internet. One particular concept within the domain of hate speech is instigating hate, which involves provoking hatred against a particular community, race, colour, gender, religion or ethnicity. In this work, we introduce \textit{ProvocationProbe} - a dataset designed to explore what distinguishes instigating hate speech from general hate speech. For this study, we collected around twenty thousand tweets from Twitter, encompassing a total of nine global controversies. These controversies span various themes including racism, politics, and religion. In this paper, i) we present an annotated dataset after comprehensive examination of all the controversies, ii) we also highlight the difference between hate speech and instigating hate speech by identifying distinguishing features, such as targeted identity attacks and reasons for hate.

ProvocationProbe: Instigating Hate Speech Dataset from Twitter

TL;DR

An annotated dataset designed to explore what distinguishes instigating hate speech from general hate speech is presented and the difference between hate speech and instigating hate speech is highlighted by identifying distinguishing features, such as targeted identity attacks and reasons for hate.

Abstract

In the recent years online social media platforms has been flooded with hateful remarks such as racism, sexism, homophobia etc. As a result, there have been many measures taken by various social media platforms to mitigate the spread of hate-speech over the internet. One particular concept within the domain of hate speech is instigating hate, which involves provoking hatred against a particular community, race, colour, gender, religion or ethnicity. In this work, we introduce \textit{ProvocationProbe} - a dataset designed to explore what distinguishes instigating hate speech from general hate speech. For this study, we collected around twenty thousand tweets from Twitter, encompassing a total of nine global controversies. These controversies span various themes including racism, politics, and religion. In this paper, i) we present an annotated dataset after comprehensive examination of all the controversies, ii) we also highlight the difference between hate speech and instigating hate speech by identifying distinguishing features, such as targeted identity attacks and reasons for hate.

Paper Structure

This paper contains 16 sections, 10 figures, 5 tables.

Figures (10)

  • Figure 1: Distinction between Non Hateful Speech, Instigating Hate Speech, Non Instigating Hate Speech
  • Figure 2: WordClouds of Instigating Hate tweets for different controversies
  • Figure 3: George Floyd.
  • Figure 4: Andrew Cuomo's Sexual Harassment Allegations
  • Figure 5: Satan Shoes
  • ...and 5 more figures