An Effective, Robust and Fairness-aware Hate Speech Detection Framework

Guanyi Mou; Kyumin Lee

An Effective, Robust and Fairness-aware Hate Speech Detection Framework

Guanyi Mou, Kyumin Lee

TL;DR

Results show that the proposed data-augmented, fairness addressed, and uncertainty estimated novel framework outperforms eight state-of-the-art methods under both no attack scenario and various attack scenarios, indicating the effectiveness and robustness of the model.

Abstract

With the widespread online social networks, hate speeches are spreading faster and causing more damage than ever before. Existing hate speech detection methods have limitations in several aspects, such as handling data insufficiency, estimating model uncertainty, improving robustness against malicious attacks, and handling unintended bias (i.e., fairness). There is an urgent need for accurate, robust, and fair hate speech classification in online social networks. To bridge the gap, we design a data-augmented, fairness addressed, and uncertainty estimated novel framework. As parts of the framework, we propose Bidirectional Quaternion-Quasi-LSTM layers to balance effectiveness and efficiency. To build a generalized model, we combine five datasets collected from three platforms. Experiment results show that our model outperforms eight state-of-the-art methods under both no attack scenario and various attack scenarios, indicating the effectiveness and robustness of our model. We share our code along with combined dataset for better future research

An Effective, Robust and Fairness-aware Hate Speech Detection Framework

TL;DR

Abstract

Paper Structure (31 sections, 14 equations, 4 figures, 4 tables)

This paper contains 31 sections, 14 equations, 4 figures, 4 tables.

Introduction
Related Work
Hate Speech Detection
Data Augmentation
Fairness and Uncertainty Estimation
Preliminary
Quaternion Algebra and Networks
Quasi-RNNs
Our Proposed Framework
Data preprocessing
Generating CLP during training time
Deep Learning Classifier
Loss Function
Data
Dataset
...and 16 more sections

Figures (4)

Figure 1: The overview of our framework.
Figure 2: BiQQLSTM CLP VS. Top 5 baselines under avg. MCC of 5 seeds.
Figure 3: Our BiQQLSTM CLP's performance with each augmentation method.
Figure 4: NLG augmented data readability distribution before & after filtering.

An Effective, Robust and Fairness-aware Hate Speech Detection Framework

TL;DR

Abstract

An Effective, Robust and Fairness-aware Hate Speech Detection Framework

Authors

TL;DR

Abstract

Table of Contents

Figures (4)