Split Gibbs Discrete Diffusion Posterior Sampling
Wenda Chu, Zihui Wu, Yifan Chen, Yang Song, Yisong Yue
TL;DR
This work tackles posterior sampling in discrete-state spaces by enabling plug-and-play conditioning with discrete diffusion priors. It introduces Split Gibbs Discrete Diffusion Posterior Sampling (SGDD), an alternating likelihood-prior scheme augmented by an auxiliary variable ${\mathbf{z}}$ and a regularization $D({\mathbf{x}},{\mathbf{z}};\eta)$, with convergence guarantees as $\eta\to0$. A theoretical bound based on KL divergence and relative Fisher information establishes convergence to the true posterior under realistic assumptions, while empirical results across synthetic data, DNA design, discrete image inpainting, and monophonic music infilling show consistent performance gains over baselines. The method enables reward-guided generation and solving inverse problems in discrete spaces, offering a principled, scalable, plug-and-play framework with available code for community use and extension.
Abstract
We study the problem of posterior sampling in discrete-state spaces using discrete diffusion models. While posterior sampling methods for continuous diffusion models have achieved remarkable progress, analogous methods for discrete diffusion models remain challenging. In this work, we introduce a principled plug-and-play discrete diffusion posterior sampling algorithm based on split Gibbs sampling, which we call SGDD. Our algorithm enables reward-guided generation and solving inverse problems in discrete-state spaces. We demonstrate the convergence of SGDD to the target posterior distribution and verify this through controlled experiments on synthetic benchmarks. Our method enjoys state-of-the-art posterior sampling performance on a range of benchmarks for discrete data, including DNA sequence design, discrete image inverse problems, and music infilling, achieving more than 30% improved performance compared to existing baselines. Our code is available at https://github.com/chuwd19/Split-Gibbs-Discrete-Diffusion-Posterior-Sampling.
