Unlocking the Power of Boltzmann Machines by Parallelizable Sampler and Efficient Temperature Estimation
Kentaro Kubo, Hayato Goto
TL;DR
The paper tackles the training bottleneck of energy-based Boltzmann machines by introducing sampler-adaptive learning (SAL), which combines Langevin simulated bifurcation (LSB) for fast parallel sampling with conditional expectation matching (CEM) to estimate the effective inverse temperature during learning. This enables training of semi-restricted Boltzmann machines (SRBMs) and other expressive BMs beyond Restricted Boltzmann Machines (RBMs) by performing gradient-based updates on the KL divergence with a negative phase from LSB samples and a positive phase using β_eff estimated by CEM. Empirical results on a spin-glass model, Bars-and-Stripes images, and OptDigits demonstrate that SAL improves learning efficiency and generative/reconstruction/classification performance, while enabling conditional generation. The framework broadens the practical applicability of energy-based models and points to future extensions with deeper architectures and alternative parallel samplers, along with open theoretical questions about LSB’s probabilistic guarantees and hyperparameter selection.
Abstract
Boltzmann machines (BMs) are powerful energy-based generative models, but their heavy training cost has largely confined practical use to Restricted BMs (RBMs) trained with an efficient learning method called contrastive divergence. More accurate learning typically requires Markov chain Monte Carlo (MCMC) Boltzmann sampling, but it is time-consuming due to the difficulty of parallelization for more expressive models. To address this limitation, we first propose a new Boltzmann sampler inspired by a quantum-inspired combinatorial optimization called simulated bifurcation (SB). This SB-inspired approach, which we name Langevin SB (LSB), enables parallelized sampling while maintaining accuracy comparable to MCMC. Furthermore, this is applicable not only to RBMs but also to BMs with general couplings. However, LSB cannot control the inverse temperature of the output Boltzmann distribution, which hinders learning and degrades performance. To overcome this limitation, we also developed an efficient method for estimating the inverse temperature during the learning process, which we call conditional expectation matching (CEM). By combining LSB and CEM, we establish an efficient learning framework for BMs with greater expressive power than RBMs. We refer to this framework as sampler-adaptive learning (SAL). SAL opens new avenues for energy-based generative modeling beyond RBMs.
