Quantum Circuit Mutants: Empirical Analysis and Recommendations
Eñaut Mendiluze Usandizaga, Tao Yue, Paolo Arcaini, Shaukat Ali
TL;DR
The paper tackles the challenge of evaluating quantum software testing by conducting a large-scale empirical study of over 700K quantum circuit mutants generated from 382 real circuits. Using Muskit/QMutPy-based mutation, it analyzes how mutation operators, circuit attributes, and algorithm types influence fault survivability, and introduces a dataset and a recommendation tool to help researchers and practitioners create mutants with controllable difficulty. Key findings show Add mutations and faults at circuit ends tend to survive longer, while dominant-output algorithms generally exhibit higher survivability; circuit complexity shows limited predictive value for survivability. The work provides practical benchmarks and a scalable framework for systematic quantum mutation analysis, with implications for designing cost-effective testing strategies and guiding future research toward real-hardware validation and equivalence reduction methods.
Abstract
As a new research area, quantum software testing lacks systematic testing benchmarks to assess testing techniques' effectiveness. Recently, some open-source benchmarks and mutation analysis tools have emerged. However, there is insufficient evidence on how various quantum circuit characteristics (e.g., circuit depth, number of quantum gates), algorithms (e.g., Quantum Approximate Optimization Algorithm), and mutation characteristics (e.g., mutation operators) affect the detection of mutants in quantum circuits. Studying such relations is important to systematically design faulty benchmarks with varied attributes (e.g., the difficulty in detecting a seeded fault) to facilitate assessing the cost-effectiveness of quantum software testing techniques efficiently. To this end, we present a large-scale empirical evaluation with more than 700K faulty benchmarks (quantum circuits) generated by mutating 382 real-world quantum circuits. Based on the results, we provide valuable insights for researchers to define systematic quantum mutation analysis techniques. We also provide a tool to recommend mutants to users based on chosen characteristics (e.g., a quantum algorithm type) and the required difficulty of detecting mutants. Finally, we also provide faulty benchmarks that can already be used to assess the cost-effectiveness of quantum software testing techniques.
