Accurate and Scalable Graph Neural Networks via Message Invariance

Zhihao Shi; Jie Wang; Zhiwei Zhuang; Xize Liang; Bin Li; Feng Wu

Accurate and Scalable Graph Neural Networks via Message Invariance

Zhihao Shi, Jie Wang, Zhiwei Zhuang, Xize Liang, Bin Li, Feng Wu

TL;DR

This work addresses the computational blow-up in multi-layer GNNs caused by recursive MP_OB in mini-batch settings. It introduces TOP, a topological compensation framework that exploits message invariance to replace expensive MP_OB with a fast MP_IB via a learned linear transformation R, approximating out-of-batch messages from in-batch embeddings. The authors provide theoretical convergence guarantees and demonstrate through extensive experiments that TOP achieves near full-batch accuracy with order-of-magnitude speedups and lower memory usage on large-scale graphs, outperforming existing subgraph, node-wise, and layer-wise sampling methods. The approach is practically impactful for scalable GNN training on graphs with millions of nodes and billions of edges, with strong empirical performance across diverse datasets and backbones.

Abstract

Message passing-based graph neural networks (GNNs) have achieved great success in many real-world applications. For a sampled mini-batch of target nodes, the message passing process is divided into two parts: message passing between nodes within the batch (MP-IB) and message passing from nodes outside the batch to those within it (MP-OB). However, MP-OB recursively relies on higher-order out-of-batch neighbors, leading to an exponentially growing computational cost with respect to the number of layers. Due to the neighbor explosion, the whole message passing stores most nodes and edges on the GPU such that many GNNs are infeasible to large-scale graphs. To address this challenge, we propose an accurate and fast mini-batch approach for large graph transductive learning, namely topological compensation (TOP), which obtains the outputs of the whole message passing solely through MP-IB, without the costly MP-OB. The major pillar of TOP is a novel concept of message invariance, which defines message-invariant transformations to convert costly MP-OB into fast MP-IB. This ensures that the modified MP-IB has the same output as the whole message passing. Experiments demonstrate that TOP is significantly faster than existing mini-batch methods by order of magnitude on vast graphs (millions of nodes and billions of edges) with limited accuracy degradation.

Accurate and Scalable Graph Neural Networks via Message Invariance

TL;DR

Abstract

Accurate and Scalable Graph Neural Networks via Message Invariance

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (13)