Learned Reference-based Diffusion Sampling for multi-modal distributions
Maxence Noble, Louis Grenioux, Marylou Gabrié, Alain Oliviero Durmus
TL;DR
This work tackles the challenge of sampling from multimodal target densities when only unnormalized densities are available. It introduces Learned Reference-based Diffusion Sampling (LRDS), a two-step approach that first learns a reference diffusion model from high-density region samples and then trains a diffusion-based sampler guided by this reference. LRDS comes in two practical flavors, GMM-LRDS and EBM-LRDS, to accommodate a wide range of target geometries, and outperforms existing diffusion-based samplers on challenging multimodal distributions. The framework connects to Schrödinger bridge and Doob transform perspectives and shows promise for extending diffusion sampling to non-Euclidean spaces with learned references.
Abstract
Over the past few years, several approaches utilizing score-based diffusion have been proposed to sample from probability distributions, that is without having access to exact samples and relying solely on evaluations of unnormalized densities. The resulting samplers approximate the time-reversal of a noising diffusion process, bridging the target distribution to an easy-to-sample base distribution. In practice, the performance of these methods heavily depends on key hyperparameters that require ground truth samples to be accurately tuned. Our work aims to highlight and address this fundamental issue, focusing in particular on multi-modal distributions, which pose significant challenges for existing sampling methods. Building on existing approaches, we introduce Learned Reference-based Diffusion Sampler (LRDS), a methodology specifically designed to leverage prior knowledge on the location of the target modes in order to bypass the obstacle of hyperparameter tuning. LRDS proceeds in two steps by (i) learning a reference diffusion model on samples located in high-density space regions and tailored for multimodality, and (ii) using this reference model to foster the training of a diffusion-based sampler. We experimentally demonstrate that LRDS best exploits prior knowledge on the target distribution compared to competing algorithms on a variety of challenging distributions.
