Attention-based Graph Neural Network for Semi-supervised Learning

Kiran K. Thekumparampil; Chong Wang; Sewoong Oh; Li-Jia Li

Attention-based Graph Neural Network for Semi-supervised Learning

Kiran K. Thekumparampil, Chong Wang, Sewoong Oh, Li-Jia Li

TL;DR

This work investigates semi-supervised node classification on graphs and reveals that propagation strength is the key driver of performance, enabling a lightweight attention-based approach. It introduces AGNN, which replaces dense nonlinear layers with dynamic, cosine-based attention over neighbors, controlled by per-layer scalar betas. Empirically, AGNN sets new state-of-the-art results on CiteSeer, Cora, and PubMed while using far fewer parameters and offering interpretability through attention weights. The results suggest that attention-guided propagation can yield both accurate and scalable graph-based learning, with enhanced insight into neighbor influence. The authors also demonstrate that simpler, more stable architectures can outperform deeper, more complex models in low-label regimes.

Abstract

Recently popularized graph neural networks achieve the state-of-the-art accuracy on a number of standard benchmark datasets for graph-based semi-supervised learning, improving significantly over existing approaches. These architectures alternate between a propagation layer that aggregates the hidden states of the local neighborhood and a fully-connected layer. Perhaps surprisingly, we show that a linear model, that removes all the intermediate fully-connected layers, is still able to achieve a performance comparable to the state-of-the-art models. This significantly reduces the number of parameters, which is critical for semi-supervised learning where number of labeled examples are small. This in turn allows a room for designing more innovative propagation layers. Based on this insight, we propose a novel graph neural network that removes all the intermediate fully-connected layers, and replaces the propagation layers with attention mechanisms that respect the structure of the graph. The attention mechanism allows us to learn a dynamic and adaptive local summary of the neighborhood to achieve more accurate predictions. In a number of experiments on benchmark citation networks datasets, we demonstrate that our approach outperforms competing methods. By examining the attention weights among neighbors, we show that our model provides some interesting insights on how neighbors influence each other.

Attention-based Graph Neural Network for Semi-supervised Learning

TL;DR

Abstract

Attention-based Graph Neural Network for Semi-supervised Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)