Noise-Adaptive Quantum Circuit Mapping for Multi-Chip NISQ Systems via Deep Reinforcement Learning
Atiye Zeynali, Zahra Bakhshi
TL;DR
DeepQMap reframes quantum circuit mapping for multi‑chip NISQ systems as a noise‑aware sequential decision problem, using a Bidirectional LSTM Dynamic Noise Adaptation (DNA) network to forecast short‑term hardware noise and a multi‑head attention module to capture long‑range qubit dependencies. The Rainbow DQN framework integrates prioritized replay, dueling networks, and multi‑step returns to learn robust, scalable mapping policies that minimize inter‑chip communication while maximizing fidelity. Empirical evaluation on 270 benchmarks spanning QFT, Grover, and VQE demonstrates a 49.3% fidelity improvement over QUBO baselines, a 79.8% reduction in inter‑chip operations, and an 8.2× faster training time, with sustained performance up to 100 qubits. The approach generalizes across circuit families and hardware topologies, offering practical, scalable improvements for near‑term quantum computing. These results indicate that predictive noise modeling combined with structured RL can significantly enhance real‑world quantum compilation and control.
Abstract
The transition from monolithic to distributed multi-chip quantum architectures has fundamentally altered the circuit compilation landscape, introducing challenges in managing temporal noise variations and minimizing expensive inter-chip operations. We present DeepQMap, a deep reinforcement learning framework that integrates a bidirectional Long Short-Term Memory based Dynamic Noise Adaptation (DNA) network with multi-head attention mechanisms and Rainbow DQN architecture. Unlike conventional static optimization approaches such as QUBO formulations, our method continuously adapts to hardware dynamics through learned temporal representations of quantum system behavior. Comprehensive evaluation across 270 benchmark circuits spanning Quantum Fourier Transform, Grover's algorithm, and Variational Quantum Eigensolver demonstrates that DeepQMap achieves mean circuit fidelity of $0.920 \pm 0.023$, representing a statistically significant 49.3\% improvement over state-of-the-art QUBO methods ($0.618 \pm 0.031$, $t_{98} = 4.87$, $p = 0.0023$, Cohen's $d = 2.34$). Inter-chip communication overhead reduces by 79.8\%, decreasing from 2.34 operations per circuit to 0.47. The DNA network maintains noise prediction accuracy with coefficient of determination $R^2 = 0.912$ and mean absolute error of 0.87\%, enabling proactive compensation for hardware fluctuations. Scalability analysis confirms sustained performance across 20-100 qubit systems, with fidelity remaining above 0.87 even at maximum scale where competing methods degrade below 0.60. Training convergence occurs 8.2$\times$ faster than baseline approaches, completing in 45 minutes versus 370 minutes for QUBO optimization. Very large effect sizes validate practical significance for near-term noisy intermediate-scale quantum computing applications.
