Sharp Bounds for Poly-GNNs and the Effect of Graph Noise
Luciano Vinas, Arash A. Amini
TL;DR
The paper studies semi-supervised node classification with poly-GNNs under CSBM, establishing a sharp, depth-invariant rate of class-separation in the final node representations. It develops a novel walk-based noise-decomposition framework and derives tight upper and lower bounds for the signal and noise components of the feature SNR, showing the leading rate scales as $ u_n^{1/2}$ regardless of depth $k$ (for large graphs). The analysis reveals dominant walk types that govern noise, explains how graph noise can overwhelm aggregation benefits, and clarifies even/odd-depth effects on feature propagation. The results imply that graph aggregation improves performance over feature-only baselines at rate $ u_n^{1/2}$, while warning against unbounded depth due to oversmoothing being a scale effect, not a SNR gain.
Abstract
We investigate the classification performance of graph neural networks with graph-polynomial features, poly-GNNs, on the problem of semi-supervised node classification. We analyze poly-GNNs under a general contextual stochastic block model (CSBM) by providing a sharp characterization of the rate of separation between classes in their output node representations. A question of interest is whether this rate depends on the depth of the network $k$, i.e., whether deeper networks can achieve a faster separation? We provide a negative answer to this question: for a sufficiently large graph, a depth $k > 1$ poly-GNN exhibits the same rate of separation as a depth $k=1$ counterpart. Our analysis highlights and quantifies the impact of ``graph noise'' in deep GNNs and shows how noise in the graph structure can dominate other sources of signal in the graph, negating any benefit further aggregation provides. Our analysis also reveals subtle differences between even and odd-layered GNNs in how the feature noise propagates.
