Learning few-step posterior samplers by unfolding and distillation of diffusion models

Charlesquin Kemajou Mbakam; Jonathan Spence; Marcelo Pereyra

Learning few-step posterior samplers by unfolding and distillation of diffusion models

Charlesquin Kemajou Mbakam, Jonathan Spence, Marcelo Pereyra

TL;DR

The paper tackles Bayesian image reconstruction under ill-posed forward models by leveraging diffusion-model priors. It introduces UD$^2$Ms, a framework that unfolds the LATINO Langevin sampler into a trainable, few-step conditional diffusion model via deep unfolding and LoRA-based distillation, allowing joint handling of multiple likelihoods at inference. By sampling from $p(\mathbf{x}_0|\mathbf{y},\mathbf{x}_t)$ through a learned proximal operator and a pre-trained DM prior, UD$^2$Ms achieve high accuracy with around $\mathcal{O}(10)$ neural function evaluations, while preserving flexibility to adapt to different forward models at test time. Extensive experiments on Gaussian/uniform/motion deblurring, inpainting, SR, and JPEG artifact removal on ImageNet and LSUN demonstrate strong PSNR/LPIPS/FID gains and robust generalization, with ablations showing the benefits of unfolding depth, initialization, and LoRA rank. Overall, the approach merges the advantages of distillation and PnP strategies to deliver efficient, accurate posterior sampling for diverse inverse problems in computational imaging.

Abstract

Diffusion models (DMs) have emerged as powerful image priors in Bayesian computational imaging. Two primary strategies have been proposed for leveraging DMs in this context: Plug-and-Play methods, which are zero-shot and highly flexible but rely on approximations; and specialized conditional DMs, which achieve higher accuracy and faster inference for specific tasks through supervised training. In this work, we introduce a novel framework that integrates deep unfolding and model distillation to transform a DM image prior into a few-step conditional model for posterior sampling. A central innovation of our approach is the unfolding of a Markov chain Monte Carlo (MCMC) algorithm - specifically, the recently proposed LATINO Langevin sampler (Spagnoletti et al., 2025) - representing the first known instance of deep unfolding applied to a Monte Carlo sampling scheme. We demonstrate our proposed unfolded and distilled samplers through extensive experiments and comparisons with the state of the art, where they achieve excellent accuracy and computational efficiency, while retaining the flexibility to adapt to variations in the forward model at inference time.

Learning few-step posterior samplers by unfolding and distillation of diffusion models

TL;DR

The paper tackles Bayesian image reconstruction under ill-posed forward models by leveraging diffusion-model priors. It introduces UD

Ms, a framework that unfolds the LATINO Langevin sampler into a trainable, few-step conditional diffusion model via deep unfolding and LoRA-based distillation, allowing joint handling of multiple likelihoods at inference. By sampling from

through a learned proximal operator and a pre-trained DM prior, UD

Ms achieve high accuracy with around

neural function evaluations, while preserving flexibility to adapt to different forward models at test time. Extensive experiments on Gaussian/uniform/motion deblurring, inpainting, SR, and JPEG artifact removal on ImageNet and LSUN demonstrate strong PSNR/LPIPS/FID gains and robust generalization, with ablations showing the benefits of unfolding depth, initialization, and LoRA rank. Overall, the approach merges the advantages of distillation and PnP strategies to deliver efficient, accurate posterior sampling for diverse inverse problems in computational imaging.

Learning few-step posterior samplers by unfolding and distillation of diffusion models

TL;DR

Abstract

Learning few-step posterior samplers by unfolding and distillation of diffusion models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)