Understanding the Effect of GCN Convolutions in Regression Tasks
Juntong Chen, Johannes Schmidt-Hieber, Claire Donnat, Olga Klopp
TL;DR
This work analyzes the statistical behavior of graph convolutional networks (GCNs) in regression tasks under a fixed-design setting, focusing on how neighborhood depth $L$ and two convolution operators, the original GCN with $T$ and GraphSAGE with $S$, affect estimation error. It develops a bias-variance decomposition for linear GCNs and introduces a novel walk-based variance analysis, showing that variance can be expressed as a weighted sum over local walks and that topology can slow variance decay, contributing to over-smoothing in non-asymptotic regimes. Theoretical results identify an optimal depth $L$ that balances bias and variance and elucidate how different local topologies (e.g., rooted trees vs cycles) influence variance decay, all validated by synthetic experiments. Real-data experiments on six diverse graphs corroborate the theory and offer practical guidelines for selecting convolution type and depth to improve regression performance on graphs.
Abstract
Graph Convolutional Networks (GCNs) have become a pivotal method in machine learning for modeling functions over graphs. Despite their widespread success across various applications, their statistical properties (e.g., consistency, convergence rates) remain ill-characterized. To begin addressing this knowledge gap, we consider networks for which the graph structure implies that neighboring nodes exhibit similar signals and provide statistical theory for the impact of convolution operators. Focusing on estimators based solely on neighborhood aggregation, we examine how two common convolutions - the original GCN and GraphSAGE convolutions - affect the learning error as a function of the neighborhood topology and the number of convolutional layers. We explicitly characterize the bias-variance type trade-off incurred by GCNs as a function of the neighborhood size and identify specific graph topologies where convolution operators are less effective. Our theoretical findings are corroborated by synthetic experiments, and provide a start to a deeper quantitative understanding of convolutional effects in GCNs for offering rigorous guidelines for practitioners.
