When Clifford benchmarks are sufficient; estimating application performance with scalable proxy circuits
Seth Merkel, Timothy Proctor, Samuele Ferracin, Jordan Hines, Samantha Barron, Luke C. G. Govia, David McKay
TL;DR
The paper addresses benchmarking large quantum devices by showing that, under the Pauli twirling assumption, Clifford-based proxy circuits can tightly bound the performance of general circuits. It establishes that the diamond distance of any circuit is well approximated by the fidelity of a Cliffordized proxy, and that Clifford fidelities can be estimated robustly against SPAM using two approaches. The authors validate these ideas with simulations and IBM hardware, including a volumetric Cliffordization benchmark up to 65 qubits and comparisons with random circuit sampling. The work provides a practical, verifiable framework for benchmarking near-term quantum processors, linking tractable Clifford simulations to the performance on more complex circuits across scalable regimes.
Abstract
The goal of benchmarking is to determine how far the output of a noisy system is from its ideal behavior; this becomes exceedingly difficult for large quantum systems where classical simulations become intractable. A common approach is to turn to circuits comprised of elements of the Clifford group (e.g., CZ, CNOT, $π$ and $π/2$ gates), which probe quantum behavior but are nevertheless efficient to simulate classically. However, there is some concern that these circuits may overlook error sources that impact the larger Hilbert space. In this manuscript, we show that for a broad class of error models these concerns are unwarranted. In particular, we show that, for error models that admit noise tailoring by Pauli twirling, the diamond norm and fidelity of any generic circuit is well approximated by the fidelities of proxy circuits composed only of Clifford gates. We discuss methods for extracting the fidelities of these Clifford proxy circuits in a manner that is robust to errors in state preparation and measurement and demonstrate these methods in simulation and on IBM Quantum's fleet of deployed heron devices.
