Generative Modelling with High-Order Langevin Dynamics
Ziqiang Shi, Rujie Liu
TL;DR
This work introduces HOLD, a fast, high-order diffusion framework that augments the data state with velocity and acceleration via a third-order Langevin forward process. By combining two Hamiltonians with an OU component and employing Block Coordinate Score Matching and Lie-Trotter splitting, HOLD achieves dramatically faster mixing and high-quality sample generation, demonstrated on CIFAR-10 and CelebA-HQ-256 with state-of-the-art or competitive metrics. The approach provides a flexible objective, robust ablations, and a principled likelihood bound, expanding the applicability of score-based diffusion models to richer latent dynamics and multiple data modalities.
Abstract
Diffusion generative modelling (DGM) based on stochastic differential equations (SDEs) with score matching has achieved unprecedented results in data generation. In this paper, we propose a novel fast high-quality generative modelling method based on high-order Langevin dynamics (HOLD) with score matching. This motive is proved by third-order Langevin dynamics. By augmenting the previous SDEs, e.g. variance exploding or variance preserving SDEs for single-data variable processes, HOLD can simultaneously model position, velocity, and acceleration, thereby improving the quality and speed of the data generation at the same time. HOLD is composed of one Ornstein-Uhlenbeck process and two Hamiltonians, which reduce the mixing time by two orders of magnitude. Empirical experiments for unconditional image generation on the public data set CIFAR-10 and CelebA-HQ show that the effect is significant in both Frechet inception distance (FID) and negative log-likelihood, and achieves the state-of-the-art FID of 1.85 on CIFAR-10.
