Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics
Siqi Miao, Zhiyuan Lu, Mia Liu, Javier Duarte, Pan Li
TL;DR
The paper tackles real-time processing of massive scientific point clouds by introducing HEPT, a locality-sensitive hashing-based Efficient Point Transformer that embeds explicit local inductive bias in the attention kernel. It provides a rigorous error-computation analysis showing that Random Fourier Features under subquadratic budgets are outperformed by LSH, and that combining OR & AND LSH yields exponential error decay at near-linear computational cost. HEPT consequently achieves state-of-the-art accuracy and massive speedups (up to 203x) on two high-energy physics tasks (tracking and pileup mitigation) without requiring graph construction, highlighting its practical value for large-scale geometric deep learning in physics. The work also demonstrates solid scalability and robustness to hyperparameter choices, and offers a publicly available implementation for broader adoption in scientific domains.
Abstract
This study introduces a novel transformer model optimized for large-scale point cloud processing in scientific domains such as high-energy physics (HEP) and astrophysics. Addressing the limitations of graph neural networks and standard transformers, our model integrates local inductive bias and achieves near-linear complexity with hardware-friendly regular operations. One contribution of this work is the quantitative analysis of the error-complexity tradeoff of various sparsification techniques for building efficient transformers. Our findings highlight the superiority of using locality-sensitive hashing (LSH), especially OR & AND-construction LSH, in kernel approximation for large-scale point cloud data with local inductive bias. Based on this finding, we propose LSH-based Efficient Point Transformer (HEPT), which combines E$^2$LSH with OR & AND constructions and is built upon regular computations. HEPT demonstrates remarkable performance on two critical yet time-consuming HEP tasks, significantly outperforming existing GNNs and transformers in accuracy and computational speed, marking a significant advancement in geometric deep learning and large-scale scientific data processing. Our code is available at https://github.com/Graph-COM/HEPT.
