Optimal Transport Event Representation for Anomaly Detection
Aditya Bhargava, Tianji Cai, Benjamin Nachman
TL;DR
This work tackles resonant anomaly detection in collider data under a weak supervision framework and introduces a physics-guided intermediate representation based on optimal transport (OT). By linearizing the $2$-Wasserstein distance into the LinW$_2$ embedding and compressing it with PCA to a few informative components, the authors create OT-based features that augment standard high-level observables. In ultra-low signal regimes ($S/B$ around $0.5\%$), OT$_k$ features achieve significant significance improvements (SI $>25$) and outperform both full low-level phase-space models and pretrained foundation models, while requiring only a modest number of PCA modes. The approach remains robust across datasets (R&D1 and R&D2) and illustrates the value of physically grounded, intermediate representations as a bridge between engineered features and end-to-end ML, with code available at the provided repository.
Abstract
We introduce optimal transport (OT) as a physics-based intermediate event representation for weakly supervised anomaly detection. With only $0.5\%$ injection of resonant signals in the LHC Olympics benchmark datasets, the OT-augmented feature set achieves nearly twice the significance improvement of standard high-level observables, while end-to-end deep learning on low-level four-momenta struggles in the low-signal regime. The gains persist across signal types and classifiers, underscoring the value of structured representations in machine learning for anomaly detection.
