Quantum Vision Transformers for Quark-Gluon Classification
Marçal Comajoan Cara, Gopal Ramesh Dahale, Zhongtian Dong, Roy T. Forestano, Sergei Gleyzer, Daniel Justice, Kyoungchul Kong, Tom Magorsch, Konstantin T. Matchev, Katia Matcheva, Eyup B. Unlu
TL;DR
The paper tackles quark–gluon jet classification under HL-LHC-scale data constraints by proposing a quantum-classical hybrid Vision Transformer (QViT) that embeds variational quantum circuits into both the attention and MLP components. The approach is evaluated on CMS Open Data jet images, showing that the QViT achieves nearly parity with a classical Vision Transformer having a similar parameter count, albeit with a small ~2% AUC gap likely due to optimization and expressivity limitations of simulated VQCs. Key contributions include a concrete QViT design with four 4-qubit VQCs replacing linear projections in MHA and MLP, and a demonstration that quantum-inspired components can match classical performance on a realistic HEP task. The work provides a practical path toward quantum-assisted ML for high-energy physics, with plans to test on real quantum hardware, explore data augmentation and data re-uploading, and extend hyperparameter search to seek potential quantum advantages.
Abstract
We introduce a hybrid quantum-classical vision transformer architecture, notable for its integration of variational quantum circuits within both the attention mechanism and the multi-layer perceptrons. The research addresses the critical challenge of computational efficiency and resource constraints in analyzing data from the upcoming High Luminosity Large Hadron Collider, presenting the architecture as a potential solution. In particular, we evaluate our method by applying the model to multi-detector jet images from CMS Open Data. The goal is to distinguish quark-initiated from gluon-initiated jets. We successfully train the quantum model and evaluate it via numerical simulations. Using this approach, we achieve classification performance almost on par with the one obtained with the completely classical architecture, considering a similar number of parameters.
