Trustworthy Quantum Machine Learning: A Roadmap for Reliability, Robustness, and Security in the NISQ Era
Ferhat Ozgur Catak, Jungwon Seo, Umit Cali
TL;DR
This paper proposes a unified roadmap for Trustworthy Quantum Machine Learning (TQML) in the NISQ era, centering reliability on three pillars: uncertainty quantification, adversarial robustness, and privacy preservation. It develops a quantum-information-theoretic framework for trust metrics, outlines quantum-specific attack surfaces and defenses, and demonstrates feasibility on current devices through a unified trust-assessment pipeline. The work combines theoretical foundations with extensive experiments on uncertainty quantification, adversarial attacks including FGSM and PGD, and federated quantum learning with differential privacy to reveal practical trade-offs between reliability, robustness, and privacy. The findings show that uncertainty metrics effectively predict misclassifications, classical gradient-based attacks pose significant threats to quantum classifiers (with quantum-state perturbations being less effective), and privacy-preserving federated learning can achieve meaningful privacy gains with manageable accuracy costs. Overall, the roadmap provides concrete methods, metrics, and experimental validation to advance trustworthy quantum AI during the NISQ era and beyond, urging standardized benchmarks and certification pathways for responsible quantum deployment.
Abstract
Quantum machine learning (QML) is a promising paradigm for tackling computational problems that challenge classical AI. Yet, the inherent probabilistic behavior of quantum mechanics, device noise in NISQ hardware, and hybrid quantum-classical execution pipelines introduce new risks that prevent reliable deployment of QML in real-world, safety-critical settings. This research offers a broad roadmap for Trustworthy Quantum Machine Learning (TQML), integrating three foundational pillars of reliability: (i) uncertainty quantification for calibrated and risk-aware decision making, (ii) adversarial robustness against classical and quantum-native threat models, and (iii) privacy preservation in distributed and delegated quantum learning scenarios. We formalize quantum-specific trust metrics grounded in quantum information theory, including a variance-based decomposition of predictive uncertainty, trace-distance-bounded robustness, and differential privacy for hybrid learning channels. To demonstrate feasibility on current NISQ devices, we validate a unified trust assessment pipeline on parameterized quantum classifiers, uncovering correlations between uncertainty and prediction risk, an asymmetry in attack vulnerability between classical and quantum state perturbations, and privacy-utility trade-offs driven by shot noise and quantum channel noise. This roadmap seeks to define trustworthiness as a first-class design objective for quantum AI.
