Observability Architecture for Quantum-Centric Supercomputing Workflows
Naoki Kanazawa, Yuto Morohoshi, Hitomi Takahashi, Yukio Kawashima, Hiroshi Horii, Kengo Nakajima
TL;DR
This work tackles the challenge of observing quantum-centric supercomputing (QCSC) workflows, which combine probabilistic quantum kernels with large-scale classical orchestration and remote hardware. It proposes an application-level observability architecture organized around a workflow metrics pyramid, decoupled telemetry processing, and persistent storage to enable retrospective analysis without affecting primary execution. The authors implement this architecture on the Miyabi supercomputer and IBM Quantum systems, using Prefect for workflow orchestration and Apache Superset for dashboards, and demonstrate its utility through a closed-loop SQD workflow that employs differential evolution to study a [4Fe-4S] chemistry Hamiltonian. The results show how domain-level telemetry reveals solver dynamics and resource bottlenecks, enabling infrastructure-aware algorithm design and systematic experimentation in QCSC environments.
Abstract
Quantum-centric supercomputing (QCSC) workflows often involve hybrid classical-quantum algorithms that are inherently probabilistic and executed on remote quantum hardware, making them difficult to interpret and limiting the ability to monitor runtime performance and behavior. The high cost of quantum circuit execution and large-scale high-performance computing (HPC) infrastructure further restricts the number of feasible trials, making comprehensive evaluation of execution results essential for iterative development. We propose an observability architecture tailored for QCSC workflows that decouples telemetry collection from workload execution, enabling persistent monitoring across system and algorithmic layers and retaining detailed execution data for reproducible and retrospective analysis, eliminating redundant runs. Applied to a representative workflow involving sample-based quantum diagonalization, our system reveals solver behavior across multiple iterations. This approach enhances transparency and reproducibility in QCSC environments, supporting infrastructure-aware algorithm design and systematic experimentation.
