Recent quantum runtime (dis)advantages
J. Tuziemski, J. Pawłowski, P. Tarasiuk, Ł. Pawela, B. Gardas
TL;DR
The paper interrogates reported quantum runtime advantages on NISQ devices by enforcing end-to-end timing with overheads through the $TT_{\varepsilon}$ metric, arguing that previous analyses often misrepresent true runtimes. It re-evaluates three milestones—quantum annealing for approximate QUBO, gate-based restricted Simon’s problem, and the BF-DCQO hybrid—and finds no durable runtime advantage when proper timing and strong classical baselines are used. The authors advocate for careful reference selection, avoidance of cherry-picking, and consideration of hardware-specific overheads to avoid false claims of supremacy. Their conclusion is that credible, runtime-based quantum supremacy on current hardware remains elusive, requiring rigorous benchmarking and problem-class-appropriate comparisons.
Abstract
We (re)evaluate recent claims of quantum advantage in annealing- and gate-based algorithms, testing whether reported speedups survive rigorous end-to-end runtime definitions and comparison against strong classical baselines. Conventional analyses often omit substantial overhead (readout, transpilation, thermalization, etc.) yielding biased assessments. While excluding seemingly not important parts of the simulation may seem reasonable, on most current quantum hardware a clean separation between "pure compute" and "overhead" cannot be experimentally justified. This may distort "supremacy" results. In contrast, for most classical hardware total time $\approx$ compute $+$ a weakly varying constant leading to robust claims. We scrutinize two important milestones: (1) quantum annealing for approximate QUBO PRL 134, 160601 (2025) [https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.134.160601], which uses a sensible time-to-$ε$ metric but proxies runtime by the annealing time (non-measurable); (2) a restricted Simon's problem PRX 15, 021082 (2025) [https://journals.aps.org/prx/abstract/10.1103/PhysRevX.15.021082] , whose advantageous scaling in oracle calls is undisputed; yet, as we demonstrate, estimated runtime of the quantum experiment is $\sim 100 \times$ slower than a tuned classical baseline. Finally, we show that recently claimed "runtime advantage" of the BF-DCQO hybrid algorithm (arXiv:2505.08663) does not withstand rigorous benchmarking. Therefore, we conclude that runtime-based supremacy remains elusive on NISQ hardware, and credible claims require a careful time accounting with a proper reference selections, and an adequate metric.
