COMBA: Cross Batch Aggregation for Learning Large Graphs with Context Gating State Space Models
Jiajun Shen, Yufei Jin, Yi He, xingquan Zhu
TL;DR
COMBA tackles the challenge of learning long-range dependencies on large homogeneous graphs by embedding graph structure into a batched, hop-aware state-space framework. It integrates three components—hop-aware local context, cross-batch aggregation, and context gating—to enable scalable, linear-time–like learning while preserving global information. The authors provide a theoretical guarantee that cross-batch aggregation reduces approximation error relative to isolated batch training, and empirically show superior accuracy and robustness across six benchmark graphs, with favorable scalability. This approach offers a practical pathway to apply state-space models to graphs at scale, potentially informing future designs for efficient long-range graph modeling in real-world networks.
Abstract
State space models (SSMs) have recently emerged for modeling long-range dependency in sequence data, with much simplified computational costs than modern alternatives, such as transformers. Advancing SMMs to graph structured data, especially for large graphs, is a significant challenge because SSMs are sequence models and the shear graph volumes make it very expensive to convert graphs as sequences for effective learning. In this paper, we propose COMBA to tackle large graph learning using state space models, with two key innovations: graph context gating and cross batch aggregation. Graph context refers to different hops of neighborhood for each node, and graph context gating allows COMBA to use such context to learn best control of neighbor aggregation. For each graph context, COMBA samples nodes as batches, and train a graph neural network (GNN), with information being aggregated cross batches, allowing COMBA to scale to large graphs. Our theoretical study asserts that cross-batch aggregation guarantees lower error than training GNN without aggregation. Experiments on benchmark networks demonstrate significant performance gains compared to baseline approaches. Code and benchmark datasets will be released for public access.
