A Decision-Based Heterogenous Graph Attention Network for Multi-Class Fake News Detection
Batool Lakzaei, Mostafa Haghir Chehreghani, Alireza Bagheri
TL;DR
This work tackles multiclass fake news detection under semi-supervised constraints by modeling news items as a heterogeneous graph and enabling per-node, per-layer adaptive neighborhood selection. The core method, DHGAT, comprises a decision network that uses $GATv2$ to produce neighborhood-type probabilities $\boldsymbol{\rho}^l_v$ over a set of types $\Gamma$, and a representation network that updates node embeddings using the selected neighborhood type via a Straight-Through $Gumbel$-Softmax sampling $\gamma^l_v$. The final node embeddings are classified with an MLP using a loss that combines cross-entropy over the small labeled set and a semantic-distance term to align predicted scores with ground-truth scores on the six LIAR classes. Evaluated on the LIAR dataset with varying labeled data proportions, DHGAT achieves about a 4% accuracy improvement over strong baselines, demonstrating robustness to limited supervision and the effectiveness of dynamic, task-specific computation graphs for fake news detection.
Abstract
A promising tool for addressing fake news detection is Graph Neural Networks (GNNs). However, most existing GNN-based methods rely on binary classification, categorizing news as either real or fake. Additionally, traditional GNN models use a static neighborhood for each node, making them susceptible to issues like over-squashing. In this paper, we introduce a novel model named Decision-based Heterogeneous Graph Attention Network (DHGAT) for fake news detection in a semi-supervised setting. DHGAT effectively addresses the limitations of traditional GNNs by dynamically optimizing and selecting the neighborhood type for each node in every layer. It represents news data as a heterogeneous graph where nodes (news items) are connected by various types of edges. The architecture of DHGAT consists of a decision network that determines the optimal neighborhood type and a representation network that updates node embeddings based on this selection. As a result, each node learns an optimal and task-specific computational graph, enhancing both the accuracy and efficiency of the fake news detection process. We evaluate DHGAT on the LIAR dataset, a large and challenging dataset for multi-class fake news detection, which includes news items categorized into six classes. Our results demonstrate that DHGAT outperforms existing methods, improving accuracy by approximately 4% and showing robustness with limited labeled data.
