Graph Neural Networks on Discriminative Graphs of Words

Yassine Abbahaddou; Johannes F. Lutzeyer; Michalis Vazirgiannis

Graph Neural Networks on Discriminative Graphs of Words

Yassine Abbahaddou, Johannes F. Lutzeyer, Michalis Vazirgiannis

TL;DR

This work explores a new Discriminative Graph of Words Graph Neural Network (DGoW-GNN) approach encapsulating both a novel discriminative graph construction and model to classify text, and proposes a new model for the graph-based classification of text, which combines a GNN and a sequence model.

Abstract

In light of the recent success of Graph Neural Networks (GNNs) and their ability to perform inference on complex data structures, many studies apply GNNs to the task of text classification. In most previous methods, a heterogeneous graph, containing both word and document nodes, is constructed using the entire corpus and a GNN is used to classify document nodes. In this work, we explore a new Discriminative Graph of Words Graph Neural Network (DGoW-GNN) approach encapsulating both a novel discriminative graph construction and model to classify text. In our graph construction, containing only word nodes and no document nodes, we split the training corpus into disconnected subgraphs according to their labels and weight edges by the pointwise mutual information of the represented words. Our graph construction, for which we provide theoretical motivation, allows us to reformulate the task of text classification as the task of walk classification. We also propose a new model for the graph-based classification of text, which combines a GNN and a sequence model. We evaluate our approach on seven benchmark datasets and find that it is outperformed by several state-of-the-art baseline models. We analyse reasons for this performance difference and hypothesise under which conditions it is likely to change.

Graph Neural Networks on Discriminative Graphs of Words

TL;DR

Abstract

Graph Neural Networks on Discriminative Graphs of Words

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (3)