Provable Accuracy Bounds for Hybrid Dynamical Optimization and Sampling
Matthew X. Burns, Qingyuan Hou, Michael C. Huang
TL;DR
This work provides the first non-asymptotic probabilistic guarantees for hybrid LNLS using block Langevin diffusion, establishing exponential KL convergence for ideal DXs under a log-Sobolev inequality and quantifying a finite-variation induced bias in the 2-Wasserstein metric. It introduces Randomized and Cyclic Block Langevin Diffusion with explicit constants, and derives non-asymptotic W2 bounds that connect device variation, step size, and noise to performance. Numerical experiments on Gaussian targets illustrate the theory, showing how hyperparameters and analog non-idealities shape convergence and bias, and revealing an equivalence in convergence behavior between randomized and cyclic block strategies. The results offer principled design guidance for hybrid DX systems, enabling principled hyperparameter tuning and a clearer understanding of how analog imperfections affect accuracy in sampling and optimization tasks.
Abstract
Analog dynamical accelerators (DXs) are a growing sub-field in computer architecture research, offering order-of-magnitude gains in power efficiency and latency over traditional digital methods in several machine learning, optimization, and sampling tasks. However, limited-capacity accelerators require hybrid analog/digital algorithms to solve real-world problems, commonly using large-neighborhood local search (LNLS) frameworks. Unlike fully digital algorithms, hybrid LNLS has no non-asymptotic convergence guarantees and no principled hyperparameter selection schemes, particularly limiting cross-device training and inference. In this work, we provide non-asymptotic convergence guarantees for hybrid LNLS by reducing to block Langevin Diffusion (BLD) algorithms. Adapting tools from classical sampling theory, we prove exponential KL-divergence convergence for randomized and cyclic block selection strategies using ideal DXs. With finite device variation, we provide explicit bounds on the 2-Wasserstein bias in terms of step duration, noise strength, and function parameters. Our BLD model provides a key link between established theory and novel computing platforms, and our theoretical results provide a closed-form expression linking device variation, algorithm hyperparameters, and performance.
