Quantum Embedding with Transformer for High-dimensional Data

Hao-Yuan Chen; Yen-Jui Chang; Shih-Wei Liao; Ching-Ray Chang

Quantum Embedding with Transformer for High-dimensional Data

Hao-Yuan Chen, Yen-Jui Chang, Shih-Wei Liao, Ching-Ray Chang

TL;DR

This work tackles high-dimensional binary classification by integrating Vision Transformer features with a trainable linear layer that parameterizes a quantum embedding feeding a quantum neural network. The hybrid pipeline employs a Pauli-feature-map-based quantum kernel and a Real Amplitude Ansatz, trained end-to-end with backpropagation on near-term devices. Empirical results on BirdCLEF-2021 show the transformer-based quantum embedding achieves higher F1 scores and far lower variability than CNN-based baselines, indicating both effectiveness and reliability. The study suggests scalability to larger quantum circuits and points to future exploration of quantum kernel methods for CV and NLP tasks in a hybrid quantum-classical framework.

Abstract

Quantum embedding with transformers is a novel and promising architecture for quantum machine learning to deliver exceptional capability on near-term devices or simulators. The research incorporated a vision transformer (ViT) to advance quantum significantly embedding ability and results for a single qubit classifier with around 3 percent in the median F1 score on the BirdCLEF-2021, a challenging high-dimensional dataset. The study showcases and analyzes empirical evidence that our transformer-based architecture is a highly versatile and practical approach to modern quantum machine learning problems.

Quantum Embedding with Transformer for High-dimensional Data

TL;DR

Abstract

Paper Structure (15 sections, 5 equations, 4 figures, 1 table, 1 algorithm)

This paper contains 15 sections, 5 equations, 4 figures, 1 table, 1 algorithm.

Introduction
Methods
Model Architecture
Vision Transformer
Trainable Quantum Embedding with Classical Linear Transformation
Mathematical Model
Quantum Feature Map and Ansatz
Results
Effectiveness on Binary Classification
Performance Reliability
Discussion
Extension to Larger Circuits
Technical Challenges
Outlooks
Conclusion

Figures (4)

Figure 1: Model architecture of the research on quantum embedding with transformers for high-dimensional visual datasets. The orange part stands for the classical state of data, and the blue part means a quantum state of information. The input data feeds from the left to the right to classify the input information.
Figure 2: Quantum neural network's architecture. Once the data processed by the transformer, the linear representation layer will transform the data and embed into the quantum circuit forming quantum state of information.
Figure 3: Comparative results for the BirdCLEF-2021 dataset over various embedding methods, including classical CNN method (at the far left), CNN-based quantum embedding (at the middle), the architecture proposed in this research, which is transformer-based quantum embedding (at the far right)
Figure 4: Comparative results for the standard deviation of F1 score for BirdCLEF-2021 dataset over various embedding methods or training method, including classical CNN method (at the top), CNN-based quantum embedding (at the middle), the architecture proposed in this research, which is transformer-based quantum embedding (at the button)

Quantum Embedding with Transformer for High-dimensional Data

TL;DR

Abstract

Quantum Embedding with Transformer for High-dimensional Data

Authors

TL;DR

Abstract

Table of Contents

Figures (4)