How to pick the best anomaly detector?
Marie Hein, Gregor Kasieczka, Michael Krämer, Louis Moureaux, Alexander Mück, David Shih
TL;DR
The paper tackles the problem of selecting the most sensitive anomaly detector for model-agnostic LHC searches by introducing ARGOS, a fully data-driven metric with a solid theoretical basis that is monotonic with the standard SIC in the ideal background-template limit. Defined as $\text{ARGOS} = \frac{\epsilon_{\text{SR}}}{\sqrt{\epsilon_{\text{BT}}}} - \sqrt{\epsilon_{\text{BT}}}$, ARGOS leverages a background template to enable data-driven working-point optimization without relying on labeled signals. Through comprehensive experiments on LHCO data with three weakly supervised detectors (IAD, CWoLa Hunting, CATHODE) and three classifier families (NN, HGB, AdaBoost), ARGOS consistently outperforms BCE-based selection for hyperparameters, architectures, and epoch choices, and can even guide feature selection. The approach offers a practical, label-free tool for detector tuning in real data and holds potential for broader applicability beyond resonant anomaly detection, while acknowledging limitations when background templates are imperfect. Overall, ARGOS provides a principled, data-driven framework for selecting and tuning anomaly detectors in high-energy physics analyses.
Abstract
Anomaly detection has the potential to discover new physics in unexplored regions of the data. However, choosing the best anomaly detector for a given data set in a model-agnostic way is an important challenge which has hitherto largely been neglected. In this paper, we introduce the data-driven ARGOS metric, which has a sound theoretical foundation and is empirically shown to robustly select the most sensitive anomaly detection model given the data. Focusing on weakly-supervised, classifier-based anomaly detection methods, we show that the ARGOS metric outperforms other model selection metrics previously used in the literature, in particular the binary cross-entropy loss. We explore several realistic applications, including hyperparameter tuning as well as architecture and feature selection, and in all cases we demonstrate that ARGOS is robust to the noisy conditions of anomaly detection.
