Graph Neural Aggregation-diffusion with Metastability
Kaiyuan Cui, Xinyan Wang, Zicheng Zhang, Weichen Zhao
TL;DR
This paper introduces GRADE, a Graph Neural Network framework built on aggregation-diffusion equations that balance nonlinear diffusion with interaction-driven aggregation to produce metastable, clustered node representations. By embedding the aggregation-diffusion dynamics into a Neural ODE, GRADE provides a flexible, general approach that subsumes diffusion-only GNNs and connects to classical GNNs. The key contribution is showing that metastability can mitigate over-smoothing, evidenced by sustained Dirichlet energy and competitive accuracy across both homophilic and heterophilic graphs. The work offers a principled, physics-inspired viewpoint on graph representation learning with practical benefits for robust, expressive GNNs.
Abstract
Continuous graph neural models based on differential equations have expanded the architecture of graph neural networks (GNNs). Due to the connection between graph diffusion and message passing, diffusion-based models have been widely studied. However, diffusion naturally drives the system towards an equilibrium state, leading to issues like over-smoothing. To this end, we propose GRADE inspired by graph aggregation-diffusion equations, which includes the delicate balance between nonlinear diffusion and aggregation induced by interaction potentials. The node representations obtained through aggregation-diffusion equations exhibit metastability, indicating that features can aggregate into multiple clusters. In addition, the dynamics within these clusters can persist for long time periods, offering the potential to alleviate over-smoothing effects. This nonlinear diffusion in our model generalizes existing diffusion-based models and establishes a connection with classical GNNs. We prove that GRADE achieves competitive performance across various benchmarks and alleviates the over-smoothing issue in GNNs evidenced by the enhanced Dirichlet energy.
