Mitigating Barren plateaus in quantum denoising diffusion probabilistic models
Haipeng Cao, Kaining Zhang, Dacheng Tao, Zhaofeng Su
TL;DR
This work identifies a barren plateau problem in quantum denoising diffusion probabilistic models (QuDDPM) caused by Haar-random inputs to the denoising circuits, which eliminates trainability as system size grows. The authors propose an improved QuDDPM that maintains the forward-diffusion output at a controlled distance from Haar, thereby preserving gradient information during training. They provide theoretical analyses showing gradient lower bounds under proper parameter scaling and validate the approach with experiments across multiple qubit counts, demonstrating improved training dynamics and sample quality. The findings advance scalable quantum generative learning by offering a practical mitigation to barren plateaus in diffusion-based quantum models.
Abstract
Quantum generative models leverage quantum superposition and entanglement to enhance learning efficiency for both classical and quantum data. The quantum denoising diffusion probabilistic model (QuDDPM), inspired by its classical counterpart, has been proposed as a promising framework for quantum generative learning. QuDDPM is capable of efficiently learning and generating quantum data, and it demonstrates excellent performance in learning correlated quantum noise models, quantum many-body phases, and the topological structure of quantum data. However, we show that barren plateaus emerge in QuDDPMs due to the use of 2-design states as the input for the denoising process, which severely undermines the performance of QuDDPM. Through theoretical analysis and experimental validation, we confirm the presence of barren plateaus in the original QuDDPM. To address this issue, we introduce an improved QuDDPM that utilizes a distribution maintaining a certain distance from the Haar distribution, ensuring better trainability. Experimental results demonstrate that our approach effectively mitigates the barren plateau problem and generates samples with higher quality, paving the way for scalable and efficient quantum generative learning.
