Fast and Effective GNN Training through Sequences of Random Path Graphs
Francesco Bonchi, Claudio Gentile, Francesco Paolo Nerini, André Panisson, Fabio Vitale
TL;DR
This work tackles the scalability and generalization challenges of graph neural networks for node classification, especially under scarce labeling. It introduces Gern, a training framework that progressively refines GNN weights over a sequence of ultra-sparse Random Path Graphs (RPGs) derived from Random Spanning Trees (RSTs) and weighted by effective resistance, enabling fast training while preserving critical topology. By operating on RPGs and leveraging parallelized Approximate RSTs (A-RSTs), Gern mitigates over-squashing and over-smoothing and demonstrates improved test accuracy in small-data regimes across multiple benchmarks. Empirical results show substantial speedups and competitive or superior accuracy compared to baselines, with ablations validating the RPG-based linearization and the beneficial effect of ensemble RPGs on regularization and information propagation.
Abstract
We present GERN, a novel scalable framework for training GNNs in node classification tasks, based on effective resistance, a standard tool in spectral graph theory. Our method progressively refines the GNN weights on a sequence of random spanning trees suitably transformed into path graphs which, despite their simplicity, are shown to retain essential topological and node information of the original input graph. The sparse nature of these path graphs substantially lightens the computational burden of GNN training. This not only enhances scalability but also improves accuracy in subsequent test phases, especially under small training set regimes, which are of great practical importance, as in many real-world scenarios labels may be hard to obtain. In these settings, our framework yields very good results as it effectively counters the training deterioration caused by overfitting when the training set is small. Our method also addresses common issues like over-squashing and over-smoothing while avoiding under-reaching phenomena. Although our framework is flexible and can be deployed in several types of GNNs, in this paper we focus on graph convolutional networks and carry out an extensive experimental investigation on a number of real-world graph benchmarks, where we achieve simultaneous improvement of training speed and test accuracy over a wide pool of representative baselines.
