Run-length certificates in quantum learning: sample complexity and noise thresholds
Jeongho Bang
TL;DR
This work reframes quantum state learning under strict copy-by-copy interaction as a stopping-time problem, where learning completes via an online run-length certificate requiring $M_H$ consecutive successes. By decomposing sample complexity into a dimension-dependent search phase and a dimension-independent certification phase, the authors derive universal bounds on stopping times and establish a sharp noise threshold: when the label-flip probability $q$ satisfies $qM_H\gtrsim 1$, halting becomes exponentially expensive. The analysis reveals a fundamental information-theoretic bottleneck induced by the one-bit feedback channel, connects the observed $O(N^{-1})$ accuracy to quantum-limit constraints, and demonstrates, through simulations, the practical implications of classification noise on adaptive quantum learning. The results offer a principled framework for designing minimal-feedback protocols and stopping rules that balance exploration and sequential certification, with implications for quantum-state learning and adaptive information acquisition in noisy, resource-constrained settings.
Abstract
Quantum learning from state samples is often benchmarked in a fixed-budget paradigm, relating error to a prescribed number of copies. We instead adopt a stopping-time viewpoint: in minimal-feedback learning, the learning completion can be defined by an online run-length certificate extracted from a one-bit-per-copy record. As an instantiation, we analyze single-shot measurement learning (SSML), introduced in [Phys. Rev. A 98, 052302 (2018)] and [Phys. Rev. Lett. 126, 170504 (2021)], which tunes a unitary and halts after $M_H$ consecutive successes. Viewing the halting as a sequential certification linking the observed counter to infidelity, we derive sample-complexity bounds that separate search (driving success probability toward unity) from certification (run statistics of consecutive successes). The resulting trade-off among $M_H$, dimension $d$, and one-bit reliability clarifies when performance is control-limited versus certificate-limited. With label-flip noise probability $q$, we find a sharp feasibility threshold: once $qM_H \gtrsim 1$, the expected halting time grows exponentially, making the learning completion impractical even under ideal control. More broadly, this shows that under severely constrained feedback, the certification can dominate sample complexity and small label noise becomes the information bottleneck. Finally, the near-optimal accuracy enabled by run-length certification aligns with the quantum-state-estimation (and equivalently, no-cloning) limits, expressed in the stopping-time terms.
