BitHEP -- The Limits of Low-Precision ML in HEP
Claudius Krause, Daohan Wang, Ramon Winterhalder
TL;DR
This paper evaluates BitNet, a low-precision, quantization-aware neural-network architecture, across three core HEP tasks: quark-gluon tagging, SMEFT parameter estimation, and detector simulation. Using BitLinear layers with binary/ternary weights and 8-bit inputs, it shows competitive performance in quark-gluon classification while revealing nuanced degradation in regression and generative tasks that depends on network size and which layers are quantized. The results highlight the importance of selective quantization, showing that attention-based architectures and larger networks tend to be more resilient, and demonstrate that quantization-aware training can align efficiency gains with accuracy for HL-LHC-scale workloads. The findings motivate further work on heterogeneous, fully quantized pipelines and hardware-specific kernels to enable real-time, energy-efficient ML in high-energy physics.
Abstract
The increasing complexity of modern neural network architectures demands fast and memory-efficient implementations to mitigate computational bottlenecks. In this work, we evaluate the recently proposed BitNet architecture in HEP applications, assessing its performance in classification, regression, and generative modeling tasks. Specifically, we investigate its suitability for quark-gluon discrimination, SMEFT parameter estimation, and detector simulation, comparing its efficiency and accuracy to state-of-the-art methods. Our results show that while BitNet consistently performs competitively in classification tasks, its performance in regression and generation varies with the size and type of the network, highlighting key limitations and potential areas for improvement.
