Impact of unreliable devices on stability of quantum computations
Samudra Dasgupta, Travis S. Humble
TL;DR
This work tackles reliability and stability of quantum computations on NISQ devices under non-stationary noise by linking distributional changes in noise to output stability via the Hellinger distance $H$ and deriving a bound on stability $s_{max}$. It validates the framework with both synthetic depolarizing-noise models and real IBM Washington data using Monte Carlo methods to estimate joint noise distributions and their impact on a 5-qubit Bernstein-Vazirani circuit. The results show substantial non-stationarity in noise (reliability metric ranging from $41\%$ to $92\%$) and a conservative bound $s_{max$}$ that remains below observed stability oscillations, illustrating the device’s unreliability for reproducible mean outcomes in the BV benchmark. The methodology is general, modular, and applicable to larger-scale quantum systems, offering a principled approach to quantify reliability and guide calibration and error-mitigation strategies in the fault-tolerant transition.
Abstract
Noisy intermediate-scale quantum (NISQ) devices are valuable platforms for testing the tenets of quantum computing, but these devices are susceptible to errors arising from de-coherence, leakage, cross-talk and other sources of noise. This raises concerns regarding the stability of results when using NISQ devices since strategies for mitigating errors generally require well-characterized and stationary error models. Here, we quantify the reliability of NISQ devices by assessing the necessary conditions for generating stable results within a given tolerance. We use similarity metrics derived from device characterization data to derive and validate bounds on the stability of a 5-qubit implementation of the Bernstein-Vazirani algorithm. Simulation experiments conducted with noise data from IBM Washington, spanning January 2022 to April 2023, revealed that the reliability metric fluctuated between 41% and 92%. This variation significantly surpasses the maximum allowable threshold of 2.2% needed for stable outcomes. Consequently, the device proved unreliable for consistently reproducing the statistical mean in the context of the Bernstein-Vazirani circuit.
