Analyzing and Guiding Zero-Shot Posterior Sampling in Diffusion Models

Roi Benita; Michael Elad; Joseph Keshet

Analyzing and Guiding Zero-Shot Posterior Sampling in Diffusion Models

Roi Benita, Michael Elad, Joseph Keshet

TL;DR

This work analyzes zero-shot posterior sampling in diffusion models for linear inverse problems through a spectral lens under a Gaussian prior. It derives closed-form expressions for both the ideal posterior sampler and training-free reconstruction methods, enabling principled comparisons in the spectral domain and a method-agnostic framework for weighting guidance terms. The authors formulate an optimization based on averaged Wasserstein distance to design optimal guidance weights, and provide closed-form, per-frequency transfer functions that decouple across frequencies. Empirical results on synthetic Gaussian data and real datasets (FFHQ and ImageNet) show that the spectral recommendations offer a more balanced trade-off between measurement fidelity and perceptual quality than common heuristics, while reducing per-instance tuning requirements and adapting to diffusion step size.

Abstract

Recovering a signal from its degraded measurements is a long standing challenge in science and engineering. Recently, zero-shot diffusion based methods have been proposed for such inverse problems, offering a posterior sampling based solution that leverages prior knowledge. Such algorithms incorporate the observations through inference, often leaning on manual tuning and heuristics. In this work we propose a rigorous analysis of such approximate posterior-samplers, relying on a Gaussianity assumption of the prior. Under this regime, we show that both the ideal posterior sampler and diffusion-based reconstruction algorithms can be expressed in closed-form, enabling their thorough analysis and comparisons in the spectral domain. Building on these representations, we also introduce a principled framework for parameter design, replacing heuristic selection strategies used to date. The proposed approach is method-agnostic and yields tailored parameter choices for each algorithm, jointly accounting for the characteristics of the prior, the degraded signal, and the diffusion dynamics. We show that our spectral recommendations differ structurally from standard heuristics and vary with the diffusion step size, resulting in a consistent balance between perceptual quality and signal fidelity.

Analyzing and Guiding Zero-Shot Posterior Sampling in Diffusion Models

TL;DR

Abstract

Paper Structure (37 sections, 3 theorems, 180 equations, 12 figures, 5 tables)

This paper contains 37 sections, 3 theorems, 180 equations, 12 figures, 5 tables.

Introduction
Background
Linear Inverse Problems
Diffusion Models
training-free Diffusion-based methods for inverse problems
Guided zero-shot posterior samplers
The Optimal Denoiser for a Gaussian Prior
Reverse Process Formulation
Migrating to spectral domain
Optimal Posterior Sampling Weights
Optimal Gaussian Posterior Sampling
The Posterior Optimal Denoiser
Reverse Process Formulation
Migrating to spectral domain
Related work
...and 22 more sections

Key Result

Theorem 4.1

Let $\mathbf{x}_0 \sim \mathcal{N}(\boldsymbol{\mu}_0, \boldsymbol{\Sigma}_0)$ and let $\mathbf{x}_t$ denote the noisy signal obtained from the forward diffusion process, as defined in eq:marginal_dist. Given the linear measurement model $\mathbf{y}=\mathbf{H}\mathbf{x}_0+\mathbf{n}$, the denoised s and admits the following closed-form: A detailed proof is given in Appendix sec:appendix_Map_est.

Figures (12)

Figure 1: Spectral recommendations for the weighting coefficients $\boldsymbol{\zeta}$ in DPS for different numbers of diffusion steps $S\in[5, 30, 50, 70, 100, 120]$, with variability across realizations.
Figure 2: Comparison between the spectral recommendations (red) and DPS weighting coefficients for $70$ diffusion steps, evaluated for different heuristic values of $\zeta' \in \{0.1, 0.3, 0.5, 0.7, 1.0\}.$
Figure 3: Comparison of the Wasserstein-2 distance for DPS heuristic values $\zeta' \in \{0.1, 0.3, 0.5, 0.7, 1.0\}$, the spectral recommendations applied to DPS (red) and to $\Pi\text{GDM}$ (brown, dotted), and the analytically derived ideal posterior sampler (black), across different number of diffusion steps $S \in \{5, 10, 15, 20, 30, 50, 70, 100, 120, 150\}$.
Figure 4: Comparison of spectral recommendations on the FFHQ dataset, with a zoomed-in view of the DPS heuristics. Results are shown for selected diffusion steps $S\in\{30, 70, 100,150\}$.
Figure 5: Covariance matrix obtained for $d=50$ and $l=0.05$.
...and 7 more figures

Theorems & Definitions (3)

Theorem 4.1
Lemma 4.2
Lemma 4.3

Analyzing and Guiding Zero-Shot Posterior Sampling in Diffusion Models

TL;DR

Abstract

Analyzing and Guiding Zero-Shot Posterior Sampling in Diffusion Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (3)