Towards Understanding and Avoiding Limitations of Convolutions on Graphs
Andreas Roth
TL;DR
This work investigates why graph neural networks struggle to scale with depth, identifying two core phenomena—Shared Component Amplification (SCA) and Component Dominance (CD)—that drive rank collapse and over-smoothing. By reframing graph convolutions in spectral terms and mapping MP updates to power iterations, the authors articulate how a single computational graph inherently amplifies the same spectral component across all feature channels, limiting expressivity. To counteract this, they introduce the Multi-Relational Split (MRS) framework and the MIMO Graph Convolution (MIMO-GC), plus a localized LMGC variant, enabling multiple spectral components to be amplified across distinct relations or edges, thereby avoiding SCA and improving injectivity and expressivity. They further connect CD to PageRank and propose a Personalized PageRank GNN (PPRGNN) to permit infinite-depth propagation without losing initial information. Complementary results show that a Sum of Kronecker Products (SKP) framework can robustly avoid SCA and facilitate optimization, with empirical validation across standard graph datasets. Collectively, the work provides a cohesive theoretical foundation for understanding MP dynamics and delivers principled architectures to mitigate rank collapse and over-smoothing in graph neural networks, with practical implications for more scalable, expressive GNNs.
Abstract
While message-passing neural networks (MPNNs) have shown promising results, their real-world impact remains limited. Although various limitations have been identified, their theoretical foundations remain poorly understood, leading to fragmented research efforts. In this thesis, we provide an in-depth theoretical analysis and identify several key properties limiting the performance of MPNNs. Building on these findings, we propose several frameworks that address these shortcomings. We identify two properties exhibited by many MPNNs: shared component amplification (SCA), where each message-passing iteration amplifies the same components across all feature channels, and component dominance (CD), where a single component gets increasingly amplified as more message-passing steps are applied. These properties lead to the observable phenomenon of rank collapse of node representations, which generalizes the established over-smoothing phenomenon. By generalizing and decomposing over-smoothing, we enable a deeper understanding of MPNNs, more targeted solutions, and more precise communication within the field. To avoid SCA, we show that utilizing multiple computational graphs or edge relations is necessary. Our multi-relational split (MRS) framework transforms any existing MPNN into one that leverages multiple edge relations. Additionally, we introduce the spectral graph convolution for multiple feature channels (MIMO-GC), which naturally uses multiple computational graphs. A localized variant, LMGC, approximates the MIMO-GC while inheriting its beneficial properties. To address CD, we demonstrate a close connection between MPNNs and the PageRank algorithm. Based on personalized PageRank, we propose a variant of MPNNs that allows for infinitely many message-passing iterations, while preserving initial node features. Collectively, these results deepen the theoretical understanding of MPNNs.
