COMPAS: A Distributed Multi-Party SWAP Test for Parallel Quantum Algorithms
Brayden Goldstein-Gelb, Kun Liu, John M. Martyn, Hengyun Zhou, Yongshan Ding, Yuan Liu
TL;DR
COMPAS delivers a co-designed hardware/software approach for distributed quantum computation by implementing a constant-depth, Bell-pair–assisted multivariate trace estimation via a distributed multi-party SWAP test. It provides two two-party CSWAP realizations (telegate and teledata) and a parallel Toffoli via Fanout to preserve $O(1)$ depth while scaling Bell-pair usage as $O(nk)$. The work includes detailed circuit- and network-level error analyses and demonstrates broad applicability to Rényi entropy estimation, entanglement spectroscopy, virtual cooling/distillation, and distributed QSP, highlighting practical prospects for near-term distributed quantum hardware. It also discusses open challenges such as error correction, network topology, and Bell-pair distillation overhead, outlining a path toward scalable, architecture-aware distributed quantum algorithms. Overall, COMPAS demonstrates a viable route to performing sophisticated quantum primitives in distributed settings with controlled resource costs and fidelity losses.
Abstract
The limited number of qubits per chip remains a critical bottleneck in quantum computing, motivating the use of distributed architectures that interconnect multiple quantum processing units (QPUs). However, executing quantum algorithms across distributed systems requires careful co-design of algorithmic primitives and hardware architectures to manage circuit depth and entanglement overhead. We identify multivariate trace estimation as a key subroutine that is naturally suited for distribution, and broadly useful in tasks such as estimating Rényi entropies, virtual cooling and distillation, and certain applications of quantum signal processing. In this work, we introduce COMPAS, an architecture that realizes multivariate trace estimation across a multi-party network of interconnected modular and distributed QPUs by leveraging pre-shared entangled Bell pairs as resources. COMPAS adds only a constant depth overhead and consumes Bell pairs at a rate linear in circuit width, making it suitable for near-term hardware. Unlike other schemes, which must choose between asymptotic optimality in circuit depth or GHZ width, COMPAS achieves both at once. Additionally, we analyze network-level errors and simulate the effects of circuit-level noise on the architecture.
