Adjusted Scores for Discrete Langevin Algorithms
Armand Gissler, Saeed Saremi, Francis Bach
TL;DR
This work reframes discrete sampling on the hypercube as discretized continuous-time dynamics and introduces Gibbs and Glauber scores to drive four score-based schemes (DULA, DUPS, DMALA, DMAPS). By relating these algorithms to Glauber dynamics and analyzing their contraction and bias, it establishes conditions under which unadjusted and Metropolis-adjusted variants converge to the target $p(\cdot)$ with favorable rates. Theoretical results show small-step guarantees and zero-bias behavior for the Glauber score, while experiments across independent bits, Ising, and Curie–Weiss models demonstrate practical advantages over traditional Gibbs sampling, particularly for moderate-to-large step-sizes. This work thus provides a principled foundation for discrete Langevin methods with provable convergence properties and practical efficacy in discrete probabilistic models.
Abstract
Sampling from discrete distributions is a ubiquitous task in machine learning, recently revisited by the emergence of discrete diffusion models. While Langevin algorithms constitute the state of the art for continuous spaces, discrete versions lack similar theoretical guarantees when the step-size becomes small. In this paper, we address this limitation by interpreting discrete sampling algorithms as discretizations of continuous-time dynamics on the hypercube. In particular, we describe several score functions for discrete algorithms which result in approximations of Glauber dynamics for the correct target distribution. We also compute upper bounds for the contraction of these algorithms, with or without Metropolis adjustment.
