Guided Quantum Compression for High Dimensional Data Classification
Vasilis Belis, Patrick Odagiu, Michele Grossi, Florentin Reiter, Günther Dissertori, Sofia Vallecorsa
TL;DR
The paper addresses the challenge of applying quantum machine learning to high-dimensional LHC data by introducing Guided Quantum Compression (GQC), a hybrid architecture that simultaneously learns a low-dimensional latent representation and a quantum classifier. By coupling an auto-encoder with a variational quantum circuit and training them with a joint loss, GQC preserves discriminative structure that separate preprocessing can destroy. On simulated $t\bar{t}H(b\bar{b})$ data, GQC outperforms the conventional 2Step approach and is competitive with classical baselines, with markedly better latent-space separability when using limited feature sets. This work broadens the practical applicability of QML to realistic physics datasets and provides public data and code to foster further development.
Abstract
Quantum machine learning provides a fundamentally different approach to analyzing data. However, many interesting datasets are too complex for currently available quantum computers. Present quantum machine learning applications usually diminish this complexity by reducing the dimensionality of the data, e.g., via auto-encoders, before passing it through the quantum models. Here, we design a classical-quantum paradigm that unifies the dimensionality reduction task with a quantum classification model into a single architecture: the guided quantum compression model. We exemplify how this architecture outperforms conventional quantum machine learning approaches on a challenging binary classification problem: identifying the Higgs boson in proton-proton collisions at the LHC. Furthermore, the guided quantum compression model shows better performance compared to the deep learning benchmark when using solely the kinematic variables in our dataset.
