A Quantum Network Processor Unit for Distributed Quantum Computing
Peiyi Li, Chenxu Liu, Ji Liu, Huiyang Zhou, Ang Li
TL;DR
This work addresses the scalability bottleneck in distributed quantum computing by proposing a decoupled quantum architecture in which a Quantum Network Processing Unit (QNPU) handles inter-node communication, distinct from the local Quantum Processing Unit (QPU). It introduces DistQASM, an extension of OpenQASM for distributed programs, and a dedicated QNPU ISA and microarchitecture, including a cycle-level simulator for evaluation. The key contributions are a four-layer quantum supercomputer stack, a hardware-accelerated communication stack with EPR resource management, and a two-tier microarchitectural design (scalar and superscalar) that significantly speeds up communication-intensive workloads (e.g., BV, QFT, QAOA) by parallelizing remote operations. The results demonstrate that decoupling computation from communication and employing a wider superscalar QNPU benchmarked against monolithic designs substantially improves distributed quantum performance, supporting scalable, potentially heterogeneous quantum architectures for future quantum supercomputers.
Abstract
As quantum computing progresses, the need for scalable solutions to address large-scale computational problems has become critical. Quantum supercomputers are the next upcoming frontier by enabling multiple quantum processors to collaborate effectively to solve large-scale computational problems. The emergence of quantum supercomputers necessitates an efficient interface to manage the quantum communication protocols between quantum processors. In this paper, we propose the Quantum Network Processing Unit (QNPU), which enables quantum applications to efficiently scale beyond the capacity of individual quantum processors, serving as a critical building block for future quantum supercomputers. The QNPU works alongside the Quantum Processing Unit (QPU) in our decoupled processing units architecture, where the QPU handles local quantum operations while the QNPU manages quantum communication between nodes. We design a comprehensive instruction set architecture (ISA) for the QNPU with high-level communication protocol abstractions, implemented via micro-operations that manage EPR resources, quantum operations, and classical communication. To facilitate programming, we introduce DistQASM, which extends OpenQASM with distributed quantum operations. We then propose a microarchitecture featuring both scalar and superscalar QNPU designs to enhance performance for communication-intensive quantum workloads. Finally, we evaluate the performance of our proposed QNPU design with distributed quantum workloads and demonstrate that the QNPU significantly improves the efficiency of communication between quantum nodes, paving the way for quantum supercomputing.
