Boltzmann Reinforcement Learning for Noise resilience in Analog Ising Machines
Aditya Choudhary, Saaketh Desai, Prasad Iyer
TL;DR
This work addresses noise-limited sampling and optimization on Analog Ising Machines by reframing the problem as distribution learning. It introduces BRAIN, a variational reinforcement approach that learns a factorized Bernoulli sampler $q_\theta(x)$ to approximate the Boltzmann distribution $p(x) = \frac{e^{-\beta E(x)}}{Z}$ using noisy energy measurements. By applying REINFORCE with baselines and keeping the sampler lightweight, BRAIN achieves high ground-state fidelity under realistic noise (e.g., 3% relative energy noise) and significantly accelerates solution time compared with MCMC, while scaling to tens of thousands of spins ($\mathcal{O}(N^{1.55})$). The method also captures thermodynamic phase transitions and metastable states across topologies, including Curie–Weiss and Lenz–Ising models, demonstrating robustness to measurement uncertainty up to 40%. This approach holds promise for enabling robust, low-latency optimization on hardware that inherently exhibits stochastic energy evaluations.
Abstract
Analog Ising machines (AIMs) have emerged as a promising paradigm for combinatorial optimization, utilizing physical dynamics to solve Ising problems with high energy efficiency. However, the performance of traditional optimization and sampling algorithms on these platforms is often limited by inherent measurement noise. We introduce BRAIN (Boltzmann Reinforcement for Analog Ising Networks), a distribution learning framework that utilizes variational reinforcement learning to approximate the Boltzmann distribution. By shifting from state-by-state sampling to aggregating information across multiple noisy measurements, BRAIN is resilient to Gaussian noise characteristic of AIMs. We evaluate BRAIN across diverse combinatorial topologies, including the Curie-Weiss and 2D nearest-neighbor Ising systems. We find that under realistic 3\% Gaussian measurement noise, BRAIN maintains 98\% ground state fidelity, whereas Markov Chain Monte Carlo (MCMC) methods degrade to 51\% fidelity. Furthermore, BRAIN reaches the MCMC-equivalent solution up to 192x faster under these conditions. BRAIN exhibits $\mathcal{O}(N^{1.55})$ scaling up to 65,536 spins and maintains robustness against severe measurement uncertainty up to 40\%. Beyond ground state optimization, BRAIN accurately captures thermodynamic phase transitions and metastable states, providing a scalable and noise-resilient method for utilizing analog computing architectures in complex optimizations.
