HamilToniQ: An Open-Source Benchmark Toolkit for Quantum Computers
Xiaotian Xu, Kuan-Cheng Chen, Robert Wille
TL;DR
HamilToniQ addresses the lack of standardized, application-oriented benchmarks for quantum processors by offering an open-source toolkit that benchmarks QPUs via QAOA-based tasks and a standardized H-Score that integrates hardware, compilation, and error-mitigation considerations. It uses a ground-truth noiseless reference and a PDF-based scoring framework, enabling self-normalized, cross-system comparisons and support for distributed workloads in Quantum-HPC environments. The authors validate the approach on IBM QPUs, examining topology, error-mitigation protocols, and the benefits of optimized compilation for multi-QPU scenarios. The work provides a scalable, transparent benchmark framework for the full quantum software stack and practical guidance for resource management in future quantum infrastructures.
Abstract
In this paper, we introduce HamilToniQ, an open-source, and application-oriented benchmarking toolkit for the comprehensive evaluation of Quantum Processing Units (QPUs). Designed to navigate the complexities of quantum computations, HamilToniQ incorporates a methodological framework assessing QPU types, topologies, and multi-QPU systems. The toolkit facilitates the evaluation of QPUs' performance through multiple steps including quantum circuit compilation and quantum error mitigation (QEM), integrating strategies that are unique to each stage. HamilToniQ's standardized score, H-Score, quantifies the fidelity and reliability of QPUs, providing a multidimensional perspective of QPU performance. With a focus on the Quantum Approximate Optimization Algorithm (QAOA), the toolkit enables direct, comparable analysis of QPUs, enhancing transparency and equity in benchmarking. Demonstrated in this paper, HamilToniQ has been validated on various IBM QPUs, affirming its effectiveness and robustness. Overall, HamilToniQ significantly contributes to the advancement of the quantum computing field by offering precise and equitable benchmarking metrics.
