Log Gaussian Cox Process Background Modeling in High Energy Physics
Yuval Frid, Liron Barak, Pavani Jairam, Michael Kagan, Rachel Jordan Hyneman
TL;DR
This work introduces Log Gaussian Cox Processes (LGCP) as a flexible, non-parametric method for modeling smooth backgrounds in bump-hunt analyses at the LHC. By treating event counts as a non-homogeneous Poisson process with intensity lambda(x) = N_E * exp(Z(x)) where Z is a Gaussian process, LGCP provides posterior uncertainties and unbinned fitting, addressing limitations of analytic functions and Gaussian-process regression. The authors develop a two-stage MCMC inference scheme to optimize kernel hyperparameters and sample the latent log-intensity, and extend LGCP to joint background+signal fits with a Gaussian signal shape. Through toy studies and spurious-signal tests, LGCP is shown to perform well for background-only modeling and can recover injected signals up to ~5% of the total yield, offering a practical, uncertainty-quantified alternative for LHC analyses with sidebands. Overall, LGCP complements existing approaches by enabling flexible background modeling without strong shape assumptions while providing interpretable uncertainties for inference in high energy physics.
Abstract
Background modeling is one of the most critical components in high energy physics data analyses, and for smooth backgrounds it is often performed by fitting using an analytic functional form. In this paper a novel method based on Log Gaussian Cox Processes (LGCP) is introduced to model smooth backgrounds while making minimal assumptions on the underlying shape. In LGCP, samples are assumed to be drawn from a non-homogeneous Poisson process, with an intensity function drawn from a Gaussian process. Markov Chain Monte Carlo is used for optimizing the hyper parameters and drawing the final fit for the background estimate from the posterior. Synthetic experiments comparing background modeling from functional forms and the LGCP are used to compare the different methods.
