Discrete generative diffusion models without stochastic differential equations: a tensor network approach
Luke Causer, Grant M. Rotskoff, Juan P. Garrahan
TL;DR
This work develops tensor-network–based discrete diffusion models (DDMs) to sample lattice systems with discrete degrees of freedom without solving stochastic differential equations. By representing both probability vectors $P_{m heta}$ and evolution operators as matrix product states/operators, the forward noising channel $oldsymbol{ ext{W}}$ and the reverse denoising channel $oldsymbol{ ext{W}}^{oldsymbol heta}_{ ilde t}$ can be implemented exactly within TN contractions, and sampling is achieved via autoregressive TN samplers. The authors integrate DDM proposals with MCMC through two update schemes—disconnected and connected—demonstrating that the connected variant yields higher acceptance and controllable correlation through the denoising time $T$. They apply the framework to the Fredkin spin chain and the 2D Ising model on a cylinder, showing how a learnable MPS $P_{m heta}$ can be optimized via negative log-likelihood and gradient-based methods to approximate target Boltzmann distributions efficiently across phases. This approach offers a scalable route to Boltzmann-like sampling on discrete lattices and motivates extensions to higher dimensions and other TN topologies (e.g., TTN, PEPS).
Abstract
Diffusion models (DMs) are a class of generative machine learning methods that sample a target distribution by transforming samples of a trivial (often Gaussian) distribution using a learned stochastic differential equation. In standard DMs, this is done by learning a ``score function'' that reverses the effect of adding diffusive noise to the distribution of interest. Here we consider the generalisation of DMs to lattice systems with discrete degrees of freedom, and where noise is added via Markov chain jump dynamics. We show how to use tensor networks (TNs) to efficiently define and sample such ``discrete diffusion models'' (DDMs) without explicitly having to solve a stochastic differential equation. We show the following: (i) by parametrising the data and evolution operators as TNs, the denoising dynamics can be represented exactly; (ii) the auto-regressive nature of TNs allows to generate samples efficiently and without bias; (iii) for sampling Boltzmann-like distributions, TNs allow to construct an efficient learning scheme that integrates well with Monte Carlo. We illustrate this approach to study the equilibrium of two models with non-trivial thermodynamics, the $d=1$ constrained Fredkin chain and the $d=2$ Ising model.
