The role of data embedding in quantum autoencoders for improved anomaly detection
Jack Y. Araz, Michael Spannowsky
TL;DR
This work investigates how data embedding methods influence the performance of Quantum Autoencoders (QAEs) for anomaly detection. By comparing standard angle embedding with data reuploading, parallel embedding, and alternate embedding, and by employing strongly entangling variational layers, the study demonstrates that embedding choices can dramatically improve the representability of data and anomaly-detection accuracy across both 2D and high-dimensional datasets. While enhanced embeddings require more qubits and deeper circuits, the gains suggest embedding strategy is a critical lever for QAE-based anomaly detection, especially as quantum hardware scales toward fault-tolerant regimes. The findings underscore the practical importance of embedding design for robust quantum machine learning in anomaly detection tasks.
Abstract
The performance of Quantum Autoencoders (QAEs) in anomaly detection tasks is critically dependent on the choice of data embedding and ansatz design. This study explores the effects of three data embedding techniques, data re-uploading, parallel embedding, and alternate embedding, on the representability and effectiveness of QAEs in detecting anomalies. Our findings reveal that even with relatively simple variational circuits, enhanced data embedding strategies can substantially improve anomaly detection accuracy and the representability of underlying data across different datasets. Starting with toy examples featuring low-dimensional data, we visually demonstrate the effect of different embedding techniques on the representability of the model. We then extend our analysis to complex, higher-dimensional datasets, highlighting the significant impact of embedding methods on QAE performance.
