Inference-Time Scaling of Discrete Diffusion Models via Importance Weighting and Optimal Proposal Design
Zijing Ou, Chinmay Pani, Yingzhen Li
TL;DR
The paper tackles the challenge of aligning outputs of discrete diffusion models with downstream constraints at inference time. It introduces a Sequential Monte Carlo (SMC) framework that yields tractable importance weights for intermediate targets and explores two practical optimal-proposal approximations: a first-order gradient-based method and an amortised proposal that minimises the log-variance of the weights. The authors derive tractable weights for product targets and reward-tilting, propose a twisted intermediate target with annealing, and empirically validate the approach across synthetic data, language modelling, biology design, and image generation, showing improved controllability and sample quality. This work provides a versatile, scalable recipe for test-time scaling and post-training alignment of discrete diffusion models, with broad applicability across domains and tasks.
Abstract
Discrete diffusion models have become highly effective across various domains. However, real-world applications often require the generative process to adhere to certain constraints. To this end, we propose a Sequential Monte Carlo (SMC) framework that enables scalable inference-time control of discrete diffusion models through principled importance weighting and optimal proposal construction. Specifically, our approach derives tractable importance weights for a range of intermediate targets and characterises the optimal proposal, for which we develop two practical approximations: a first-order gradient-based approximation and an amortised proposal trained to minimise the log-variance of the importance weights. Empirical results across synthetic tasks, language modelling, biology design, and text-to-image generation demonstrate that our framework enhances controllability and sample quality, highlighting the effectiveness of SMC as a versatile recipe for scaling discrete diffusion models at inference time.
