Unleash Graph Neural Networks from Heavy Tuning
Lequan Lin, Dai Shi, Andi Han, Zhiyong Wang, Junbin Gao
TL;DR
The paper tackles the substantial cost and overfitting risk of hyperparameter tuning in graph neural networks by introducing GNN-Diff, a graph-conditioned latent diffusion framework that learns from checkpoints saved during a light coarse search to generate high-performing GNN parameters. GNN-Diff leverages a graph autoencoder to encode X and A into a graph condition and a latent DDPM to sample latent parameter representations, which are reconstructed into full GNN weights via a parameter autoencoder. Across node classification benchmarks on both homophilic and heterophilic graphs, GNN-Diff often matches or surpasses grid search while dramatically reducing computation, and ablations confirm the value of graph conditioning for robust parameter generation. This approach shifts GNN optimization from exhaustive search toward exploring the population distribution of good parameters, enabling faster deployment of effective GNNs with improved generalization.
Abstract
Graph Neural Networks (GNNs) are deep-learning architectures designed for graph-type data, where understanding relationships among individual observations is crucial. However, achieving promising GNN performance, especially on unseen data, requires comprehensive hyperparameter tuning and meticulous training. Unfortunately, these processes come with high computational costs and significant human effort. Additionally, conventional searching algorithms such as grid search may result in overfitting on validation data, diminishing generalization accuracy. To tackle these challenges, we propose a graph conditional latent diffusion framework (GNN-Diff) to generate high-performing GNNs directly by learning from checkpoints saved during a light-tuning coarse search. Our method: (1) unleashes GNN training from heavy tuning and complex search space design; (2) produces GNN parameters that outperform those obtained through comprehensive grid search; and (3) establishes higher-quality generation for GNNs compared to diffusion frameworks designed for general neural networks.
