One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow
Pascal Jutras-Dube, Jiaru Zhang, Ziran Wang, Ruqi Zhang
TL;DR
The paper tackles the inefficiency of sampling from unnormalized densities by introducing Self-Distilled One-Step Diffusion Samplers (OSDS), which learn a step-conditioned probability-flow ODE to emulate many small steps in a single large step. To address the instability of ELBO estimates in few-step regimes, OSDS combines a forward-consistent state distillation with a volume-consistency regularizer and introduces deterministic-flow importance weights that bypass fragile backward kernels. Empirically, OSDS achieves competitive sample quality with orders-of-magnitude fewer network evaluations and provides robust log Z estimates across synthetic and Bayesian benchmarks, including strongly multimodal targets. The work thus offers a practical framework for fast, reliable sampling and evidentiary estimation from unnormalized densities, with potential impact on scalable Bayesian inference and statistical physics simulations.
Abstract
Sampling from unnormalized target distributions is a fundamental yet challenging task in machine learning and statistics. Existing sampling algorithms typically require many iterative steps to produce high-quality samples, leading to high computational costs. We introduce one-step diffusion samplers which learn a step-conditioned ODE so that one large step reproduces the trajectory of many small ones via a state-space consistency loss. We further show that standard ELBO estimates in diffusion samplers degrade in the few-step regime because common discrete integrators yield mismatched forward/backward transition kernels. Motivated by this analysis, we derive a deterministic-flow (DF) importance weight for ELBO estimation without a backward kernel. To calibrate DF, we introduce a volume-consistency regularization that aligns the accumulated volume change along the flow across step resolutions. Our proposed sampler therefore achieves both sampling and stable evidence estimate in only one or few steps. Across challenging synthetic and Bayesian benchmarks, it achieves competitive sample quality with orders-of-magnitude fewer network evaluations while maintaining robust ELBO estimates.
