Learning Diffusion Priors from Observations by Expectation Maximization
François Rozet, Gérôme Andry, François Lanusse, Gilles Louppe
TL;DR
This work addresses learning high-quality diffusion priors when only noisy, incomplete observations are available. It casts diffusion-prior training as an empirical Bayes problem solved by an Expectation-Maximization framework (DiEM), and it introduces Moment Matching Posterior Sampling (MMPS) to produce accurate posterior samples without destabilizing the diffusion process. The method is validated on a low-dimensional manifold, corrupted CIFAR-10, and accelerated MRI, showing that the learned diffusion priors yield plausible, data-consistent reconstructions and can handle substantial information loss. The approach enables diffusion priors to be used reliably in scientific inverse problems where clean, large datasets are scarce, potentially broadening the applicability of diffusion-based Bayesian inference.
Abstract
Diffusion models recently proved to be remarkable priors for Bayesian inverse problems. However, training these models typically requires access to large amounts of clean data, which could prove difficult in some settings. In this work, we present DiEM, a novel method based on the expectation-maximization algorithm for training diffusion models from incomplete and noisy observations only. Unlike previous works, DiEM leads to proper diffusion models, which is crucial for downstream tasks. As part of our methods, we propose and motivate an improved posterior sampling scheme for unconditional diffusion models. We present empirical evidence supporting the effectiveness of our approach.
