Transformers for Charged Particle Track Reconstruction in High Energy Physics
Samuel Van Stroud, Philippa Duckett, Max Hart, Nikita Pond, Sébastien Rettie, Gabriel Facini, Tim Scanlon
TL;DR
The paper tackles the HL-LHC track-reconstruction bottleneck by introducing a two-stage Transformer-based pipeline: a windowed-hit filtering stage to dramatically reduce hit multiplicity, and a MaskFormer-inspired track reconstruction stage that jointly assigns hits and regresses track parameters. Evaluated on TrackML, the approach achieves a state-of-the-art $\approx$97% tracking efficiency with a low fake rate around $0.6\%$ and latency near $100\,$ms per event on a standard GPU, demonstrating both high accuracy and real-time feasibility. The work highlights two innovations—phi-local, windowed self-attention for linear-scaling hit filtering and an end-to-end learnable MaskFormer-style tracker—that enable flexible deployment from trigger-level to offline reconstruction. While results on TrackML are strong, the paper discusses the need for validation under realistic detector conditions, integration with existing frameworks, and possible extensions to include strip-layer data and uncertainty estimates, aiming for broader applicability in high-energy physics reconstruction. The findings suggest a promising direction toward unified, scalable learned reconstruction methods that can adapt to diverse detectors and operational requirements.
Abstract
Reconstructing charged particle tracks is a fundamental task in modern collider experiments. The unprecedented particle multiplicities expected at the High-Luminosity Large Hadron Collider (HL-LHC) pose significant challenges for track reconstruction, where traditional algorithms become computationally infeasible. To address this challenge, we present a novel learned approach to track reconstruction that adapts recent advances in computer vision and object detection. Our architecture combines a Transformer hit filtering network with a MaskFormer reconstruction model that jointly optimises hit assignments and the estimation of the charged particles' properties. Evaluated on the TrackML dataset, our best performing model achieves state-of-the-art tracking performance with 97% efficiency for a fake rate of 0.6%, and inference times of 100ms. Our tunable approach enables specialisation for specific applications like triggering systems, while its underlying principles can be extended to other reconstruction challenges in high energy physics. This work demonstrates the potential of modern deep learning architectures to address emerging computational challenges in particle physics while maintaining the precision required for groundbreaking physics analysis.
