Application of Structured State Space Models to High energy physics with locality-sensitive hashing
Cheng Jiang, Sitian Qian
TL;DR
This work addresses the challenge of processing HL-LHC-scale data, characterized by large point clouds and long sequences, by applying structured state-space models (SSMs) augmented with locality-sensitive hashing (LSH). It investigates pure SSMs and hybrid Transformer-Mamba architectures (Mamba) with OR & AND LSH to achieve near-linear computational complexity while preserving physics performance. The results demonstrate substantial FLOP reductions (over 10×) and superior or comparable tracking and pileup metrics, with notable gains in recall and throughput, signaling a practical path for HL-LHC data analysis. Overall, the approach provides a viable, efficient alternative to full transformer backbones for HEP tasks that exhibit strong local inductive bias, enabling faster inference without sacrificing key physics outcomes.
Abstract
Modern high-energy physics (HEP) experiments are increasingly challenged by the vast size and complexity of their datasets, particularly regarding large-scale point cloud processing and long sequences. In this study, to address these challenges, we explore the application of structured state space models (SSMs), proposing one of the first trials to integrate local-sensitive hashing into either a hybrid or pure Mamba Model. Our results demonstrate that pure SSMs could serve as powerful backbones for HEP problems involving tasks for long sequence data with local inductive bias. By integrating locality-sensitive hashing into Mamba blocks, we achieve significant improvements over traditional backbones in key HEP tasks, surpassing them in inference speed and physics metrics while reducing computational overhead. In key tests, our approach demonstrated promising results, presenting a viable alternative to traditional transformer backbones by significantly reducing FLOPS while maintaining robust performance.
