Generative modeling using evolved quantum Boltzmann machines
Mark M. Wilde
TL;DR
This work provides a practical framework for Born-rule generative modeling using evolved quantum Boltzmann machines by leveraging the Donsker–Varadhan variational representation of relative entropy and the evolved quantum Boltzmann gradient estimator. It formulates a minimax objective with a neural-network feature map and develops four hybrid quantum–classical algorithms (extragradient, two-timescale descent-ascent, follow-the-ridge, HessianFR) for training, along with analytical gradient/Hessian expressions and norm bounds. The approach extends to alternative distinguishability measures such as Rényi relative quasi-entropies and includes a linear-feature-space variant to preserve concavity and improve convergence guarantees. This framework advances practical training of EQBMs for Born-rule sampling and provides a foundation for future convergence analysis and numerical validation.
Abstract
Born-rule generative modeling, a central task in quantum machine learning, seeks to learn probability distributions that can be efficiently sampled by measuring complex quantum states. One hope is for quantum models to efficiently capture probability distributions that are difficult to learn and simulate by classical means alone. Quantum Boltzmann machines were proposed about one decade ago for this purpose, yet efficient training methods have remained elusive. In this paper, I overcome this obstacle by proposing a practical solution that trains quantum Boltzmann machines for Born-rule generative modeling. Two key ingredients in the proposal are the Donsker-Varadhan variational representation of the classical relative entropy and the quantum Boltzmann gradient estimator of [Patel et al., arXiv:2410.12935]. I present the main result for a more general ansatz known as an evolved quantum Boltzmann machine [Minervini et al., arXiv:2501.03367], which combines parameterized real- and imaginary-time evolution. I also show how to extend the findings to other distinguishability measures beyond relative entropy. Finally, I present four different hybrid quantum-classical algorithms for the minimax optimization underlying training, and I discuss their theoretical convergence guarantees.
