Optimised neural networks for online processing of ATLAS calorimeter data on FPGAs
Georges Aad, Raphael Bertrand, Lauri Laatu, Emmanuel Monnier, Arno Straessner, Nairit Sur, Johann C. Voigt
TL;DR
This work tackles energy reconstruction for ATLAS LAr calorimeter cells under HL-LHC pile-up by deploying FPGA-compatible neural networks. Through Bayesian hyperparameter optimisation, Dense, CNN, and combined Dense+RNN architectures achieve about $80~\mathrm{MeV}$ energy resolution, outperforming the current optimal filtering method and comparable RNNs while staying under hardware limits. The Dense architecture is extended with Deep Evidential Regression to provide per-event uncertainties via a Normal–Inverse–Gamma distribution, with epistemic uncertainty dominating and overall uncertainties consistent with prediction residuals. The results demonstrate feasible, low-latency FPGA implementations that improve energy scale accuracy and supply reliable uncertainty estimates for clustering and trigger decision-making in HL-LHC conditions.
Abstract
A study of neural network architectures for the reconstruction of the energy deposited in the cells of the ATLAS liquid-argon calorimeters under high pile-up conditions expected at the HL-LHC is presented. These networks are designed to run on the FPGA-based readout hardware of the calorimeters under strict size and latency constraints. Several architectures, including Dense, Recurrent (RNN), and Convolutional (CNN) neural networks, are optimised using a Bayesian procedure that balances energy resolution against network size. The optimised Dense, CNN, and combined Dense+RNN architectures achieve a transverse energy resolution of approximately 80 MeV, outperforming both the optimal filtering (OF) method currently in use and RNNs of similar complexity. A detailed comparison across the full dynamic range shows that Dense, CNN, and Dense+RNN accurately reproduce the energy scale, while OF and RNNs underestimate the energy. Deep Evidential Regression is implemented within the Dense architecture to address the need for reliable per-event energy uncertainties. This approach provides predictive uncertainty estimates with minimal increase in network size. The predicted uncertainty is found to be consistent, on average, with the difference between the true deposited energy and the predicted energy.
