Conversation-Based Multimodal Abuse Detection Through Text and Graph Embeddings

Noé Cecillon; Vincent Labatut; Richard Dufour

Conversation-Based Multimodal Abuse Detection Through Text and Graph Embeddings

Noé Cecillon, Vincent Labatut, Richard Dufour

TL;DR

This work tackles abuse detection in online conversations by integrating textual content with conversational structure through representation learning. It introduces two novel whole-graph embedding methods (WDA-SG2V and WDA-WSGCN) that incorporate edge weights, directions, signs, and vertex attributes, and systematically compares them with a broad suite of text and graph embeddings. Fusion experiments demonstrate that combining text and graph modalities yields the best performance, achieving up to $F$-measure = 87.06, highlighting the complementarity of content and context signals. The study also analyzes which discriminative features are captured by embeddings, providing interpretability insights and showing directions for future multimodal and dynamic-graph extensions with practical impact for scalable abuse detection.

Abstract

Abusive behavior is common on online social networks, and forces the hosts of such platforms to find new solutions to address this problem. Various methods have been proposed to automate this task in the past decade. Most of them rely on the exchanged content, but ignore the structure and dynamics of the conversation, which could provide some relevant information. In this article, we propose to use representation learning methods to automatically produce embeddings of this textual content and of the conversational graphs depicting message exchanges. While the latter could be enhanced by including additional information on top of the raw conversational structure, no method currently exists to learn whole-graph representations using simultaneously edge directions, weights, signs, and vertex attributes. We propose two such methods to fill this gap in the literature. We experiment with 5 textual and 13 graph embedding methods, and apply them to a dataset of online messages annotated for abuse detection. Our best results achieve an F -measure of 81.02 using text alone and 80.61 using graphs alone. We also combine both modalities of information (text and graphs) through three fusion strategies, and show that this strongly improves abuse detection performance, increasing the F -measure to 87.06. Finally, we identify which specific engineered features are captured by the embedding methods under consideration. These features have clear interpretations and help explain what information the representation learning methods deem discriminative.

Conversation-Based Multimodal Abuse Detection Through Text and Graph Embeddings

TL;DR

Abstract

Conversation-Based Multimodal Abuse Detection Through Text and Graph Embeddings

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)