Universal New Physics Latent Space
Anna Hallin, Gregor Kasieczka, Sabine Kraml, André Lessa, Louis Moureaux, Tore von Schwartz, David Shih
TL;DR
The paper targets the challenge of vast Beyond the Standard Model theory space by learning a universal, two‑dimensional latent space that maps SM and diverse BSM predictions while preserving inter‑model relationships. Using a fully connected encoder trained with a contrastive loss on event‑level observables (MET and jet kinematics) across multiple datasets, it reveals that models cluster by LHC phenomenology, with distances in latent space reflecting discriminability and key physical differences such as $\Delta m$ and mediator type. Across MSSM gluino, dark‑matter mediator, and Dark Machines anomaly datasets, the latent embeddings systematically separate models into phenomenology‑driven regions and highlight observables that drive discrimination, enabling reduced benchmark sets and identification of coverage gaps. The approach promises improved reinterpretation, model‑space coverage assessment, and targeted exploration for future LHC searches, with future work aimed at incorporating cross‑sections, richer feature sets, and latent‑space sampling to reconstruct physical observables. $2$‑D latent space visualizations and distance relationships provide a practical framework for navigating the BSM landscape at colliders.
Abstract
We develop a machine learning method for mapping data originating from both Standard Model processes and various theories beyond the Standard Model into a unified representation (latent) space while conserving information about the relationship between the underlying theories. We apply our method to three examples of new physics at the LHC of increasing complexity, showing that models can be clustered according to their LHC phenomenology: different models are mapped to distinct regions in latent space, while indistinguishable models are mapped to the same region. This opens interesting new avenues on several fronts, such as model discrimination, selection of representative benchmark scenarios, and identifying gaps in the coverage of model space.
