Revisiting Neighborhood Aggregation in Graph Neural Networks for Node Classification using Statistical Signal Processing
Mounir Ghogho
TL;DR
This work reconsiders neighborhood aggregation in graph neural networks for node classification under the edge-independent node-labels (EINL) assumption using a statistical signal processing perspective. It derives Bayes-optimal, single-layer classifiers under a homoscedastic Gaussian feature model, comparing weighted sum aggregation (WSA) and sum-then-concatenate aggregation (SCA), and showing that SCA can yield larger discriminative gains especially when graph structure meaningfully informs class separation. The paper extends the analysis to higher-order neighborhoods and to node-degree–dependent label correlations, providing analytical guidance on when and how to weight neighbor information and how normalization affects performance. Simulations corroborate the theoretical insights, highlighting that fixed GNN weights may be suboptimal when graph statistics vary with degree or class, and suggesting principled alternatives for robust, statistic-aware GNN design with practical parameter estimation considerations.
Abstract
We delve into the issue of node classification within graphs, specifically reevaluating the concept of neighborhood aggregation, which is a fundamental component in graph neural networks (GNNs). Our analysis reveals conceptual flaws within certain benchmark GNN models when operating under the assumption of edge-independent node labels, a condition commonly observed in benchmark graphs employed for node classification. Approaching neighborhood aggregation from a statistical signal processing perspective, our investigation provides novel insights which may be used to design more efficient GNN models.
