Averting multi-qubit burst errors in surface code magic state factories
Jason D. Chadwick, Christopher Kang, Joshua Viszlai, Sophia Fuhui Lin, Frederic T. Chong
TL;DR
The paper tackles burst-error challenges from cosmic rays and TLS scrambling in superconducting quantum hardware, focusing on magic-state factories which dominate spacetime costs. It introduces a low-overhead software strategy: detect bursts via windowed stabilizer syndrome counts, offload defective regions, and remap the five-qubit magic-state factory to a new footprint, aided by a buffered pool of distilled $T$ states to cover detection latency. Compared to Code expansion and Distributed baselines, the remapping approach achieves orders-of-magnitude reductions in qubitcycle overhead under ideal detection (geometric means up to ~13.9×), with robustness to nonzero detection latency, though performance degrades when detection is slow, especially for TLS scrambling. This method reduces hardware burden for magic-state distillation and can generalize to other transient error sources, representing a practical software-level mitigation for near-term fault-tolerant quantum hardware.
Abstract
Fault-tolerant quantum computation relies on the assumption of time-invariant, sufficiently low physical error rates. However, current superconducting quantum computers suffer from frequent disruptive noise events, including cosmic ray impacts and shifting two-level system defects. Several methods have been proposed to mitigate these issues in software, but they add large overheads in terms of physical qubit count, as it is difficult to preserve logical information through burst error events. We focus on mitigating multi-qubit burst errors in magic state factories, which are expected to comprise up to 95% of the space cost of future quantum programs. Our key insight is that magic state factories do not need to preserve logical information over time; once we detect an increase in local physical error rates, we can simply turn off parts of the factory that are affected, re-map the factory to the new chip geometry, and continue operating. This is much more efficient than previous more general methods, and is resilient even under many simultaneous impact events. Using precise physical noise models, we show an efficient ray detection method and evaluate our strategy in different noise regimes. Compared to existing baselines, we find reductions in ray-induced overheads by several orders of magnitude, reducing total qubitcycle cost by geomean 6.5x to 13.9x depending on the noise model. This work reduces the burden on hardware by providing low-overhead software mitigation of these errors.
