MuseGNN: Forming Scalable, Convergent GNN Layers that Minimize a Sampling-Based Energy
Haitian Jiang, Renjie Liu, Zengfeng Huang, Yichuan Wang, Xiao Yan, Zhenkun Cai, Minjie Wang, David Wipf
TL;DR
MuseGNN tackles scaling of energy-based unfolded GNNs by embedding a sampling-based graph-regularized energy into the learning objective. It defines an offline-subgraph energy $\ ell_{\\text{muse}}(\\mathbb{Y}, M)$ and optimizes via alternating minimization over subgraph embeddings and shared node summaries, with an online mean estimator linking subgraphs. The authors establish convergence guarantees for both the upper-level training and the lower-level energy descent under specific settings and demonstrate stability and competitive accuracy on extremely large homogeneous graphs, including benchmarks exceeding 1 TB. This approach delivers scalable, interpretable GNN layers that retain expressive power and competitive performance without prohibitive memory requirements.
Abstract
Among the many variants of graph neural network (GNN) architectures capable of modeling data with cross-instance relations, an important subclass involves layers designed such that the forward pass iteratively reduces a graph-regularized energy function of interest. In this way, node embeddings produced at the output layer dually serve as both predictive features for solving downstream tasks (e.g., node classification) and energy function minimizers that inherit transparent, exploitable inductive biases and interpretability. However, scaling GNN architectures constructed in this way remains challenging, in part because the convergence of the forward pass may involve models with considerable depth. To tackle this limitation, we propose a sampling-based energy function and scalable GNN layers that iteratively reduce it, guided by convergence guarantees in certain settings. We also instantiate a full GNN architecture based on these designs, and the model achieves competitive accuracy and scalability when applied to the largest publicly-available node classification benchmark exceeding 1TB in size. Our source code is available at https://github.com/haitian-jiang/MuseGNN.
