Scaling Graph Neural Networks for Particle Track Reconstruction
Alok Tripathy, Alina Lazar, Xiangyang Ju, Paolo Calafiura, Katherine Yelick, Aydin Buluc
TL;DR
This work tackles memory and generalization challenges in graph neural network–based particle track reconstruction by enabling minibatch training on vertex-subset subgraphs via ShaDow sampling and introducing matrix-based acceleration. It combines an Interaction Network GNN with subgraph minibatching and optimized distributed execution to scale to large event graphs without sacrificing accuracy. The approach yields higher precision and recall than the prior full-graph training regime and achieves 1.3×–2× speedups over PyG baselines, with notable gains in GPU utilization and reduced all-reduce latency. Collectively, these advances make scalable, detector-agnostic GNN-based track reconstruction more practical for high-energy physics workloads.
Abstract
Particle track reconstruction is an important problem in high-energy physics (HEP), necessary to study properties of subatomic particles. Traditional track reconstruction algorithms scale poorly with the number of particles within the accelerator. The Exa.TrkX project, to alleviate this computational burden, introduces a pipeline that reduces particle track reconstruction to edge classification on a graph, and uses graph neural networks (GNNs) to produce particle tracks. However, this GNN-based approach is memory-prohibitive and skips graphs that would exceed GPU memory. We introduce improvements to the Exa.TrkX pipeline to train on samples of input particle graphs, and show that these improvements generalize to higher precision and recall. In addition, we adapt performance optimizations, introduced for GNN training, to fit our augmented Exa.TrkX pipeline. These optimizations provide a $2\times$ speedup over our baseline implementation in PyTorch Geometric.
