A Graph Laplacian Eigenvector-based Pre-training Method for Graph Neural Networks
Howard Dai, Nyambura Njenga, Hiren Madhu, Siddharth Viswanath, Ryan Pellico, Ian Adelstein, Smita Krishnaswamy
TL;DR
This work addresses structure-based pre-training for graph neural networks, aiming to capture global graph structure without relying on excessively deep architectures. It introduces the Laplacian Eigenvector Learning Module (LELM), a pre-training framework where a graph-level MLP and diffusion-based node features enable a GNN to predict the $k$ lowest-frequency Laplacian eigenvectors. The training objective combines an energy loss and an eigenvector loss, with a QR-based orthogonality constraint to ensure a proper eigenbasis and correct ordering, while preserving sign and basis invariances. Empirically, LELM yields consistent improvements on molecular property prediction tasks and outperforms several alternative structure-based targets, underscoring its potential as a versatile component for graph foundation models.
Abstract
The development of self-supervised graph pre-training methods is a crucial ingredient in recent efforts to design robust graph foundation models (GFMs). Structure-based pre-training methods are under-explored yet crucial for downstream applications which rely on underlying graph structure. In addition, pre-training traditional message passing GNNs to capture global and regional structure is often challenging due to the risk of oversmoothing as network depth increases. We address these gaps by proposing the Laplacian Eigenvector Learning Module (LELM), a novel pre-training module for graph neural networks (GNNs) based on predicting the low-frequency eigenvectors of the graph Laplacian. Moreover, LELM introduces a novel architecture that overcomes oversmoothing, allowing the GNN model to learn long-range interdependencies. Empirically, we show that models pre-trained via our framework outperform baseline models on downstream molecular property prediction tasks.
