Experimental robustness benchmarking of quantum neural networks on a superconducting quantum processor
Hai-Feng Zhang, Zhao-Yun Chen, Peng Wang, Liang-Liang Guo, Tian-Le Wang, Xiao-Yan Yang, Ren-Ze Zhao, Ze-An Zhao, Sheng Zhang, Lei Du, Hao-Ran Tao, Zhi-Long Jia, Wei-Cheng Kong, Huan-Yu Liu, Athanasios V. Vasilakos, Yang Yang, Yu-Chun Wu, Ji Guan, Peng Duan, Guo-Ping Guo
TL;DR
This work provides the first experimental robustness benchmark for a $20$‑qubit QNN on a superconducting quantum processor. It introduces Mask‑FGSM, a hardware‑aware, gradient‑sparsity–driven attack, and a fidelity‑based framework to quantify robustness bounds via $R_{\rm LB}$ and $R_{\rm UB}$, with an empirical gap of about $3\times 10^{-3}$ validating the bound tightness. Adversarial training markedly improves robustness by regularizing input gradients, and QNNs on hardware show stronger adversarial robustness than classical FNNs, a phenomenon attributed to gradient attenuation from intrinsic quantum noise. The study also demonstrates a practical, scalable pipeline for robustness benchmarking on NISQ devices and details the calibration, control, and readout mitigation required to realize reliable QNN experiments. Overall, this work lays a solid foundation for secure, reliable quantum machine learning on near‑term quantum hardware.
Abstract
Quantum machine learning (QML) models, like their classical counterparts, are vulnerable to adversarial attacks, hindering their secure deployment. Here, we report the first systematic experimental robustness benchmark for 20-qubit quantum neural network (QNN) classifiers executed on a superconducting processor. Our benchmarking framework features an efficient adversarial attack algorithm designed for QNNs, enabling quantitative characterization of adversarial robustness and robustness bounds. From our analysis, we verify that adversarial training reduces sensitivity to targeted perturbations by regularizing input gradients, significantly enhancing QNN's robustness. Additionally, our analysis reveals that QNNs exhibit superior adversarial robustness compared to classical neural networks, an advantage attributed to inherent quantum noise. Furthermore, the empirical upper bound extracted from our attack experiments shows a minimal deviation ($3 \times 10^{-3}$) from the theoretical lower bound, providing strong experimental confirmation of the attack's effectiveness and the tightness of fidelity-based robustness bounds. This work establishes a critical experimental framework for assessing and improving quantum adversarial robustness, paving the way for secure and reliable QML applications.
