Table of Contents
Fetching ...

Accelerating Decision Diagram-based Multi-node Quantum Simulation with Ring Communication and Automatic SWAP Insertion

Yusuke Kimura, Shaowen Li, Hiroyuki Sato, Masahiro Fujita

TL;DR

The paper tackles the memory bottleneck of statevector quantum simulation by leveraging decision diagram representations (QMDD) to compress state and gate data. It introduces a multi-node DD-based simulator that uses ring communication and automatic SWAP insertion to reduce inter-node data exchange and improve parallelism. Empirical results on Shor and QCBM circuits show up to 26× runtime reductions and demonstrate when ring topology and swap strategies yield the greatest gains. This work extends scalable quantum simulation on distributed hardware, guiding algorithm- and circuit-dependent choices for memory-efficient, high-performance modeling.

Abstract

An N-bit quantum state requires a vector of length $2^N$, leading to an exponential increase in the required memory with N in conventional statevector-based quantum simulators. A proposed solution to this issue is the decision diagram-based quantum simulator, which can significantly decrease the necessary memory and is expected to operate faster for specific quantum circuits. However, decision diagram-based quantum simulators are not easily parallelizable because data must be manipulated dynamically, and most implementations run on one thread. This paper introduces ring communication-based optimal parallelization and automatic swap insertion techniques for multi-node implementation of decision diagram-based quantum simulators. The ring communication approach is designed so that each node communicates with its neighboring nodes, which can facilitate faster and more parallel communication than broadcasting where one node needs to communicate with all nodes simultaneously. The automatic swap insertion method, an approach to minimize inter-node communication, has been employed in existing multi-node state vector-based simulators, but this paper proposes two methods specifically designed for decision diagram-based quantum simulators. These techniques were implemented and evaluated using the Shor algorithm and random circuits with up to 38 qubits using a maximum of 256 nodes. The experimental results have revealed that multi-node implementation can reduce run-time by up to 26 times. For example, Shor circuits that need 38 qubits can finish simulation in 147 seconds. Additionally, it was shown that ring communication has a higher speed-up effect than broadcast communication, and the importance of selecting the appropriate automatic swap insertion method was revealed.

Accelerating Decision Diagram-based Multi-node Quantum Simulation with Ring Communication and Automatic SWAP Insertion

TL;DR

The paper tackles the memory bottleneck of statevector quantum simulation by leveraging decision diagram representations (QMDD) to compress state and gate data. It introduces a multi-node DD-based simulator that uses ring communication and automatic SWAP insertion to reduce inter-node data exchange and improve parallelism. Empirical results on Shor and QCBM circuits show up to 26× runtime reductions and demonstrate when ring topology and swap strategies yield the greatest gains. This work extends scalable quantum simulation on distributed hardware, guiding algorithm- and circuit-dependent choices for memory-efficient, high-performance modeling.

Abstract

An N-bit quantum state requires a vector of length , leading to an exponential increase in the required memory with N in conventional statevector-based quantum simulators. A proposed solution to this issue is the decision diagram-based quantum simulator, which can significantly decrease the necessary memory and is expected to operate faster for specific quantum circuits. However, decision diagram-based quantum simulators are not easily parallelizable because data must be manipulated dynamically, and most implementations run on one thread. This paper introduces ring communication-based optimal parallelization and automatic swap insertion techniques for multi-node implementation of decision diagram-based quantum simulators. The ring communication approach is designed so that each node communicates with its neighboring nodes, which can facilitate faster and more parallel communication than broadcasting where one node needs to communicate with all nodes simultaneously. The automatic swap insertion method, an approach to minimize inter-node communication, has been employed in existing multi-node state vector-based simulators, but this paper proposes two methods specifically designed for decision diagram-based quantum simulators. These techniques were implemented and evaluated using the Shor algorithm and random circuits with up to 38 qubits using a maximum of 256 nodes. The experimental results have revealed that multi-node implementation can reduce run-time by up to 26 times. For example, Shor circuits that need 38 qubits can finish simulation in 147 seconds. Additionally, it was shown that ring communication has a higher speed-up effect than broadcast communication, and the importance of selecting the appropriate automatic swap insertion method was revealed.
Paper Structure (22 sections, 3 equations, 12 figures, 2 tables)

This paper contains 22 sections, 3 equations, 12 figures, 2 tables.

Figures (12)

  • Figure 1: Statevector in a decision diagram
  • Figure 2: Quantum gate in decision diagrams
  • Figure 3: Statevector distribution
  • Figure 4: Unitary matrix of two gates
  • Figure 5: SWAP gate & MPI communication
  • ...and 7 more figures