Quantum Embedding with Transformer for High-dimensional Data
Hao-Yuan Chen, Yen-Jui Chang, Shih-Wei Liao, Ching-Ray Chang
TL;DR
This work tackles high-dimensional binary classification by integrating Vision Transformer features with a trainable linear layer that parameterizes a quantum embedding feeding a quantum neural network. The hybrid pipeline employs a Pauli-feature-map-based quantum kernel and a Real Amplitude Ansatz, trained end-to-end with backpropagation on near-term devices. Empirical results on BirdCLEF-2021 show the transformer-based quantum embedding achieves higher F1 scores and far lower variability than CNN-based baselines, indicating both effectiveness and reliability. The study suggests scalability to larger quantum circuits and points to future exploration of quantum kernel methods for CV and NLP tasks in a hybrid quantum-classical framework.
Abstract
Quantum embedding with transformers is a novel and promising architecture for quantum machine learning to deliver exceptional capability on near-term devices or simulators. The research incorporated a vision transformer (ViT) to advance quantum significantly embedding ability and results for a single qubit classifier with around 3 percent in the median F1 score on the BirdCLEF-2021, a challenging high-dimensional dataset. The study showcases and analyzes empirical evidence that our transformer-based architecture is a highly versatile and practical approach to modern quantum machine learning problems.
