Loss function to optimise signal significance in particle physics

Jai Bardhan; Cyrin Neeraj; Subhadip Mitra; Tanumoy Mandal

Loss function to optimise signal significance in particle physics

Jai Bardhan, Cyrin Neeraj, Subhadip Mitra, Tanumoy Mandal

TL;DR

This work introduces a surrogate loss to directly maximise collider signal significance, defined as $Z \approx N_s/\sqrt{N_b}$, by formulating a submodular set function $\Delta_Z$ and applying the Lovász extension to obtain a convex surrogate $\bar{\Delta}_Z$ suitable for gradient-based training. The authors prove submodularity of the $Z$-score-based loss and demonstrate, in a toy two-background classification with a linear model, that models trained with $\bar{\Delta}_Z$ achieve higher signal efficiency at comparable $Z$ values than those trained with binarised cross-entropy. The results indicate potential gains in collider search sensitivity, with the approach naturally incorporating process cross sections into training. Code for the loss is publicly available, enabling broader experimentation and extension to more realistic settings with deep models and multiple backgrounds.

Abstract

We construct a surrogate loss to directly optimise the significance metric used in particle physics. We evaluate our loss function for a simple event classification task using a linear model and show that it produces decision boundaries that change according to the cross sections of the processes involved. We find that the models trained with the new loss have higher signal efficiency for similar values of estimated signal significance compared to ones trained with a cross-entropy loss, showing promise to improve sensitivity of particle physics searches at colliders.

Loss function to optimise signal significance in particle physics

TL;DR

This work introduces a surrogate loss to directly maximise collider signal significance, defined as

, by formulating a submodular set function

and applying the Lovász extension to obtain a convex surrogate

suitable for gradient-based training. The authors prove submodularity of the

-score-based loss and demonstrate, in a toy two-background classification with a linear model, that models trained with

achieve higher signal efficiency at comparable

values than those trained with binarised cross-entropy. The results indicate potential gains in collider search sensitivity, with the approach naturally incorporating process cross sections into training. Code for the loss is publicly available, enabling broader experimentation and extension to more realistic settings with deep models and multiple backgrounds.

Loss function to optimise signal significance in particle physics

TL;DR

Abstract

Loss function to optimise signal significance in particle physics

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)