Single-GPU GNN Systems: Traps and Pitfalls
Yidong Gong, Arnab Tarafder, Saima Afrin, Pradeep Kumar
TL;DR
This work investigates pervasive pitfalls in single-GPU GNN systems, showing that missing end-to-end accuracy, backward-computation misconceptions, framework overhead, and memory inefficiencies collectively distort performance claims. By systematically evaluating over 20 systems, the authors reveal how kernel-level optimizations can be overcredited when fundamental correctness is neglected. They propose a structured set of recommendations and a reference single-GPU GNN system designed around clear requirements, symmetry-aware storage, and native, well-ordered kernels to address these pitfalls. The results demonstrate practical improvements, including reduced memory footprints and the ability to train very large graphs on a single GPU, underscoring the importance of end-to-end evaluation and careful system design for credible progress in GNN systems research.
Abstract
The current graph neural network (GNN) systems have established a clear trend of not showing training accuracy results, and directly or indirectly relying on smaller datasets for evaluations majorly. Our in-depth analysis shows that it leads to a chain of pitfalls in the system design and evaluation process, questioning the practicality of many of the proposed system optimizations, and affecting conclusions and lessons learned. We analyze many single-GPU systems and show the fundamental impact of these pitfalls. We further develop hypotheses, recommendations, and evaluation methodologies, and provide future directions. Finally, a new reference system is developed to establish a new line of optimizations rooted in solving the system-design pitfalls efficiently and practically. The proposed design can productively be integrated into prior works, thereby truly advancing the state-of-the-art.
