Data Verification is the Future of Quantum Computing Copilots
Junhao Song, Ziqian Bi, Xinliang Chia, William Knottenbelt, Yudong Cao
TL;DR
The paper argues that quantum copilots require data verification as a minimum, due to binary correctness and an exponentially sparse space of valid designs where purely statistical learning fails. It defines verification-aware data, a priori constraints, and a verification-first architectural paradigm, and demonstrates through a Cuccaro Adder case that relying on statistics alone leads to infeasibility. Empirical results across 34 LLMs show verified-data models achieve substantially higher accuracy (0.60–0.79) and better calibration than unverified copilots, while formal verification reveals a vast design space with only a tiny fraction valid, highlighting the inefficiency of post-hoc filtering. The work advocates constraint-rich benchmarks and the explicit integration of verification into generation loops, with broad applicability to other physics- and math-constrained scientific domains.
Abstract
Quantum program generation demands a level of precision that may not be compatible with the statistical reasoning carried out in the inference of large language models (LLMs). Hallucinations are mathematically inevitable and not addressable by scaling, which leads to infeasible solutions. We argue that architectures prioritizing verification are necessary for quantum copilots and AI automation in domains governed by constraints. Our position rests on three key points: verified training data enables models to internalize precise constraints as learned structures rather than statistical approximations; verification must constrain generation rather than filter outputs, as valid designs occupy exponentially shrinking subspaces; and domains where physical laws impose correctness criteria require verification embedded as architectural primitives. Early experiments showed LLMs without data verification could only achieve a maximum accuracy of 79% in circuit optimization. Our positions are formulated as quantum computing and AI4Research community imperatives, calling for elevating verification from afterthought to architectural foundation in AI4Research.
