FIT-GNN: Faster Inference Time for GNNs that 'FIT' in Memory Using Coarsening

Shubhajit Roy; Hrriday Ruparel; Kishan Ved; Anirban Dasgupta

FIT-GNN: Faster Inference Time for GNNs that 'FIT' in Memory Using Coarsening

Shubhajit Roy, Hrriday Ruparel, Kishan Ved, Anirban Dasgupta

TL;DR

This work tackles the bottleneck of GNN inference on large graphs by using graph coarsening to split the input into subgraphs and augmenting them with Extra Nodes or Cluster Nodes to mitigate boundary information loss. The method, FIT-GNN, enables training and inference on subgraphs, yielding up to 100× faster single-node inference and substantial memory savings while preserving competitive accuracy across node- and graph-level tasks on diverse benchmarks. The authors provide a theoretical time/space complexity framework and validate the approach with extensive experiments on 13 real-world datasets, showing practical scalability where traditional full-graph inference is infeasible. Overall, FIT-GNN offers a scalable, memory-efficient pathway for deploying GNNs on large-scale graphs with minimal performance degradation.

Abstract

Scalability of Graph Neural Networks (GNNs) remains a significant challenge. To tackle this, methods like coarsening, condensation, and computation trees are used to train on a smaller graph, resulting in faster computation. Nonetheless, prior research has not adequately addressed the computational costs during the inference phase. This paper presents a novel approach to improve the scalability of GNNs by reducing computational burden during the inference phase using graph coarsening. We demonstrate two different methods -- Extra Nodes and Cluster Nodes. Our study extends the application of graph coarsening for graph-level tasks, including graph classification and graph regression. We conduct extensive experiments on multiple benchmark datasets to evaluate the performance of our approach. Our results show that the proposed method achieves orders of magnitude improvements in single-node inference time compared to traditional approaches. Furthermore, it significantly reduces memory consumption for node and graph classification and regression tasks, enabling efficient training and inference on low-resource devices where conventional methods are impractical. Notably, these computational advantages are achieved while maintaining competitive performance relative to baseline models.

FIT-GNN: Faster Inference Time for GNNs that 'FIT' in Memory Using Coarsening

TL;DR

Abstract

FIT-GNN: Faster Inference Time for GNNs that 'FIT' in Memory Using Coarsening

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (6)