The Machine Learning Landscape of Top Taggers
G. Kasieczka, T. Plehn, A. Butter, K. Cranmer, D. Debnath, B. M. Dillon, M. Fairbairn, D. A. Faroughy, W. Fedorko, C. Gay, L. Gouskos, J. F. Kamenik, P. T. Komiske, S. Leiss, A. Lister, S. Macaluso, E. M. Metodiev, L. Moore, B. Nachman, K. Nordstrom, J. Pearkes, H. Qu, Y. Rath, M. Rieger, D. Shih, J. M. Thompson, S. Varma
TL;DR
The paper benchmarks a broad spectrum of deep learning taggers for identifying boosted hadronically decaying top quarks using low-level calorimeter input. It categorizes taggers into image-based, 4-vector-based, and theory-inspired approaches, showing broadly similar performance with ROC-AUC around 0.98 and strongest results from ParticleNet and ResNeXt. It also investigates ensembles and a meta-tagger (GoaT), revealing incremental gains and underscoring remaining challenges in calibration, uncertainty treatment, IR-safety, and detector effects. Overall, it demonstrates that deep networks operating on raw jet information provide competitive, versatile tools for LHC jet tagging, while stressing practical considerations for real-world deployment and further robustness studies.
Abstract
Based on the established task of identifying boosted, hadronically decaying top quarks, we compare a wide range of modern machine learning approaches. Unlike most established methods they rely on low-level input, for instance calorimeter output. While their network architectures are vastly different, their performance is comparatively similar. In general, we find that these new approaches are extremely powerful and great fun.
