Symbolic Regression on FPGAs for Fast Machine Learning Inference
Ho Fung Tsoi, Adrian Alan Pol, Vladimir Loncar, Ekaterina Govorkova, Miles Cranmer, Sridhara Dasu, Peter Elmer, Philip Harris, Isobel Ojalvo, Maurizio Pierini
TL;DR
This work demonstrates an end-to-end symbolic regression pipeline for FPGA-based fast inference in high-energy physics by extending PySR with hls4ml support. It shows that SR can produce interpretable algebraic expressions that approximate neural networks while enabling Pareto-front optimization for speed and resource use, and it validates the approach on LHC jet tagging with substantial latency reductions. Function-approximation using LUTs yields dramatic resource reductions and latency improvements (up to $13$-fold faster, down to $5$ ns) while preserving accuracy above $90$ percent. The method offers a practical, interpretable, and resource-efficient alternative to deep learning in latency-constrained settings and opens pathways for broader SR-on-FPGA deployment.
Abstract
The high-energy physics community is investigating the potential of deploying machine-learning-based solutions on Field-Programmable Gate Arrays (FPGAs) to enhance physics sensitivity while still meeting data processing time constraints. In this contribution, we introduce a novel end-to-end procedure that utilizes a machine learning technique called symbolic regression (SR). It searches the equation space to discover algebraic relations approximating a dataset. We use PySR (a software to uncover these expressions based on an evolutionary algorithm) and extend the functionality of hls4ml (a package for machine learning inference in FPGAs) to support PySR-generated expressions for resource-constrained production environments. Deep learning models often optimize the top metric by pinning the network size because the vast hyperparameter space prevents an extensive search for neural architecture. Conversely, SR selects a set of models on the Pareto front, which allows for optimizing the performance-resource trade-off directly. By embedding symbolic forms, our implementation can dramatically reduce the computational resources needed to perform critical tasks. We validate our method on a physics benchmark: the multiclass classification of jets produced in simulated proton-proton collisions at the CERN Large Hadron Collider. We show that our approach can approximate a 3-layer neural network using an inference model that achieves up to a 13-fold decrease in execution time, down to 5 ns, while still preserving more than 90% approximation accuracy.
