Triggering Dark Showers with Conditional Dual Auto-Encoders
Luca Anzalone, Simranjit Singh Chhibra, Benedikt Maier, Nadezda Chernyavskaya, Maurizio Pierini
TL;DR
The paper tackles model-independent searches for new physics in collider data by reframing signal detection as anomaly detection on raw detector images. It introduces Conditional Dual Auto-Encoders (CoDAEs) and a categorical variant (CoDVAE) that leverage dual encoders and spatial conditioning to learn a compact latent space for robust anomaly scoring without requiring signal simulations during training. Evaluated on simulated CMS-like data for Hidden Valley scenarios (SUEP and SVJ), the approach achieves competitive AUROC and low FPR40, often surpassing traditional baselines and approaching supervised performance, while enabling fast inference suitable for high-level triggering. These results support deploying such fast, model-agnostic anomaly detectors in real-time trigger systems to enable generic discovery of unknown signals with reduced reliance on specific signal hypotheses.
Abstract
We present a family of conditional dual auto-encoders (CoDAEs) for generic and model-independent new physics searches at colliders. New physics signals, which arise from new types of particles and interactions, are considered in our study as anomalies causing deviations in data with respect to expected background events. In this work, we perform a normal-only anomaly detection, which employs only background samples, to search for manifestations of a dark version of strong force applying (variational) auto-encoders on raw detector images, which are large and highly sparse, without leveraging any physics-based pre-processing or strong assumption on the signals. The proposed CoDAE has a dual-encoder design, which is general and can learn an auxiliary yet compact latent space through spatial conditioning, showing a neat improvement over competitive physics-based baselines and related approaches, therefore also reducing the gap with fully supervised models. It is the first time an unsupervised model is shown to exhibit excellent discrimination against multiple dark shower models, illustrating the suitability of this method as an accurate, fast, model-independent algorithm to deploy, e.g., in the real-time event triggering systems of Large Hadron Collider experiments such as ATLAS and CMS.
