Diabatic quantum annealing for training energy-based generative models
Gilhan Kim, Ju-Yeon Gyhm, Daniel K. Park
TL;DR
RBMs require unbiased Boltzmann samples, but classical sampling is slow and produces correlated data. The authors apply diabatic quantum annealing (DQA) and its analytic relation between annealing schedules and the effective inverse temperature $\beta_{\mathrm{integral}}$ to generate calibrated Boltzmann samples for RBM training, enabling principled sampling without post hoc fitting. On a D-Wave device, DQA-based RBM training achieves faster convergence and lower validation error than persistent CD, while exposing a hardware-induced temperature misalignment that is corrected by an analytic rescaling factor $\alpha$. This calibration improves sampling fidelity and demonstrates the practicality and scalability potential of quantum-assisted Boltzmann sampling for energy-based models, with extensions to fully connected Boltzmann machines and gate-based implementations discussed for the future.
Abstract
Energy-based generative models, such as restricted Boltzmann machines (RBMs), require unbiased Boltzmann samples for effective training. Classical Markov chain Monte Carlo methods, however, converge slowly and yield correlated samples, making large-scale training difficult. We address this bottleneck by applying the analytic relation between annealing schedules and effective inverse temperature in diabatic quantum annealing. By implementing this prescription on a quantum annealer, we obtain temperature-controlled Boltzmann samples that enable RBM training with faster convergence and lower validation error than classical sampling. We further identify a systematic temperature misalignment intrinsic to analog quantum computers and propose an analytical rescaling method that mitigates this hardware noise, thereby enhancing the practicality of quantum annealers as Boltzmann samplers. In our method, the model's connectivity is set directly by the qubit connectivity, transforming the computational complexity inherent in classical sampling into a requirement on quantum hardware. This shift allows the approach to extend naturally from RBMs to fully connected Boltzmann machines, opening opportunities inaccessible to classical training methods.
