Finetuning Foundation Models for Joint Analysis Optimization
Matthias Vigl, Nicole Hartman, Lukas Heinrich
TL;DR
The paper investigates applying foundation-model workflows to high-energy physics by treating reconstruction as a learnable backbone and analysis as a downstream head that can be finetuned end-to-end. Through a demonstrator based on a heavy resonance decaying to two Higgs bosons, it shows that finetuning pretrained backbones yields notable gains in background rejection and data efficiency compared with frozen or from-scratch baselines, and that domain adaptation from larger jet datasets further improves performance. The study explores three architectures and three training strategies, highlighting that end-to-end optimization can achieve substantial improvements while reducing required labeled data. These findings suggest a practical path toward integrating foundation-model style pretraining, finetuning, and cross-dataset transfer in HEP analyses, with implications for calibration and task design in future work.
Abstract
In this work we demonstrate that significant gains in performance and data efficiency can be achieved in High Energy Physics (HEP) by moving beyond the standard paradigm of sequential optimization or reconstruction and analysis components. We conceptually connect HEP reconstruction and analysis to modern machine learning workflows such as pretraining, finetuning, domain adaptation and high-dimensional embedding spaces and quantify the gains in the example usecase of searches of heavy resonances decaying via an intermediate di-Higgs system to four $b$-jets.
