Sampling from Binary Quadratic Distributions via Stochastic Localization
Chenguang Wang, Kaiyuan Cui, Weichen Zhao, Tianshu Yu
TL;DR
This work addresses the challenge of sampling from binary quadratic distributions (BQDs) by applying stochastic localization (SL) to construct time-evolving posteriors that become increasingly easy to sample from as SL progresses. The authors prove that after a finite number of iterations the induced external field $h_t$ grows to infinity with high probability, which implies Poincaré inequalities and polynomial-time mixing for a broad class of discrete MCMC samplers, including Glauber dynamics and Metropolis-Hastings variants. The framework is demonstrated both theoretically and empirically: it covers warm-up and advanced DMCMC algorithms and yields consistent sampling improvements on MIS, MaxClique, and MaxCut across multiple datasets. This work thereby offers a principled, generalizable approach to accelerate discrete sampling tasks and broadens the applicability of SL in discrete domains, with potential impact on statistical physics, combinatorial optimization, and related machine learning applications.
Abstract
Sampling from binary quadratic distributions (BQDs) is a fundamental but challenging problem in discrete optimization and probabilistic inference. Previous work established theoretical guarantees for stochastic localization (SL) in continuous domains, where MCMC methods efficiently estimate the required posterior expectations during SL iterations. However, achieving similar convergence guarantees for discrete MCMC samplers in posterior estimation presents unique theoretical challenges. In this work, we present the first application of SL to general BQDs, proving that after a certain number of iterations, the external field of posterior distributions constructed by SL tends to infinity almost everywhere, hence satisfy Poincaré inequalities with probability near to 1, leading to polynomial-time mixing. This theoretical breakthrough enables efficient sampling from general BQDs, even those that may not originally possess fast mixing properties. Furthermore, our analysis, covering enormous discrete MCMC samplers based on Glauber dynamics and Metropolis-Hastings algorithms, demonstrates the broad applicability of our theoretical framework. Experiments on instances with quadratic unconstrained binary objectives, including maximum independent set, maximum cut, and maximum clique problems, demonstrate consistent improvements in sampling efficiency across different discrete MCMC samplers.
