PairNorm: Tackling Oversmoothing in GNNs
Lingxiao Zhao, Leman Akoglu
TL;DR
The paper addresses oversmoothing in deep graph neural networks by introducing PairNorm, a parameter-free normalization layer inserted between layers to preserve distance between distant node representations while keeping nearby nodes similar. Grounded in a graph-regularized view of convolution, PairNorm fixes the total pairwise distance across layers (with a variant that normalizes rows individually) and remains broadly applicable across GNN architectures. Empirical results on benchmark datasets show that PairNorm slows performance decay with depth for SGC, GCN, and GAT, and is especially beneficial in settings where many nodes lack features (SSNC-MV), enabling deeper models to outperform shallower ones. The work also introduces metrics for oversmoothing and demonstrates that PairNorm captures the balance between within-cluster cohesion and cross-cluster separation, offering a practical and scalable tool for robust deep GNNs.
Abstract
The performance of graph neural nets (GNNs) is known to gradually decrease with increasing number of layers. This decay is partly attributed to oversmoothing, where repeated graph convolutions eventually make node embeddings indistinguishable. We take a closer look at two different interpretations, aiming to quantify oversmoothing. Our main contribution is PairNorm, a novel normalization layer that is based on a careful analysis of the graph convolution operator, which prevents all node embeddings from becoming too similar. What is more, PairNorm is fast, easy to implement without any change to network architecture nor any additional parameters, and is broadly applicable to any GNN. Experiments on real-world graphs demonstrate that PairNorm makes deeper GCN, GAT, and SGC models more robust against oversmoothing, and significantly boosts performance for a new problem setting that benefits from deeper GNNs. Code is available at https://github.com/LingxiaoShawn/PairNorm.
