Loss function to optimise signal significance in particle physics
Jai Bardhan, Cyrin Neeraj, Subhadip Mitra, Tanumoy Mandal
TL;DR
This work introduces a surrogate loss to directly maximise collider signal significance, defined as $Z \approx N_s/\sqrt{N_b}$, by formulating a submodular set function $\Delta_Z$ and applying the Lovász extension to obtain a convex surrogate $\bar{\Delta}_Z$ suitable for gradient-based training. The authors prove submodularity of the $Z$-score-based loss and demonstrate, in a toy two-background classification with a linear model, that models trained with $\bar{\Delta}_Z$ achieve higher signal efficiency at comparable $Z$ values than those trained with binarised cross-entropy. The results indicate potential gains in collider search sensitivity, with the approach naturally incorporating process cross sections into training. Code for the loss is publicly available, enabling broader experimentation and extension to more realistic settings with deep models and multiple backgrounds.
Abstract
We construct a surrogate loss to directly optimise the significance metric used in particle physics. We evaluate our loss function for a simple event classification task using a linear model and show that it produces decision boundaries that change according to the cross sections of the processes involved. We find that the models trained with the new loss have higher signal efficiency for similar values of estimated signal significance compared to ones trained with a cross-entropy loss, showing promise to improve sensitivity of particle physics searches at colliders.
