Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

Xinyu Peng; Ziyang Zheng; Wenrui Dai; Nuoqian Xiao; Chenglin Li; Junni Zou; Hongkai Xiong

Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

Xinyu Peng, Ziyang Zheng, Wenrui Dai, Nuoqian Xiao, Chenglin Li, Junni Zou, Hongkai Xiong

TL;DR

This work reframes zero-shot diffusion-based solvers for inverse problems as using a Gaussian approximation to the denoising posterior with an isotropic covariance, and then optimizes the posterior covariance via maximum likelihood. It introduces plug-and-play strategies for both cases when reverse covariance is available and when it is not, including a scalable transform-domain variance model to capture pixel correlations. The approach unifies Type I (likelihood-score based) and Type II (proximal-based) guidance under a variational Gaussian posterior, derives fixed-point relations with DDPM reverse variances, and proposes Monte Carlo and transform-domain methods to estimate posteriors without retraining. Experiments on inpainting, deblurring, and super-resolution demonstrate significant improvements without hyperparameter tuning and without retraining, highlighting practical impact for diffusion-based inverse problem solvers. The work also discusses limitations of diagonal covariance and outlines directions for more expressive covariance designs and nonlinear transformation-based variance modeling.

Abstract

Recent diffusion models provide a promising zero-shot solution to noisy linear inverse problems without retraining for specific inverse problems. In this paper, we reveal that recent methods can be uniformly interpreted as employing a Gaussian approximation with hand-crafted isotropic covariance for the intractable denoising posterior to approximate the conditional posterior mean. Inspired by this finding, we propose to improve recent methods by using more principled covariance determined by maximum likelihood estimation. To achieve posterior covariance optimization without retraining, we provide general plug-and-play solutions based on two approaches specifically designed for leveraging pre-trained models with and without reverse covariance. We further propose a scalable method for learning posterior covariance prediction based on representation with orthonormal basis. Experimental results demonstrate that the proposed methods significantly enhance reconstruction performance without requiring hyperparameter tuning.

Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

TL;DR

Abstract

Paper Structure (32 sections, 8 theorems, 82 equations, 12 figures, 4 tables)

This paper contains 32 sections, 8 theorems, 82 equations, 12 figures, 4 tables.

Introduction
Background
Bayesian Framework for Solving Inverse Problems
Diffusion Models and Conditioning
Unified Interpretation of Diffusion-based Solvers to Inverse Problems
Type I Guidance: Approximating the Likelihood Score Function
Type II Guidance: Approximating the Conditional Posterior Mean Using Proximal Solution
Solving Inverse Problems with Optimal Posterior Covariance
Posterior Covariance Optimization
Converting Optimal Reverse Variances
Monte Carlo Estimation of Posterior Variances
Modeling Pixel-Correlations With Latent Variances
Related Work
Experiments
Sanity Check for Converting Reverse Variance
...and 17 more sections

Key Result

Proposition 3.1

The conditional posterior mean is equal to the posterior mean drifted by scaled likelihood score function. Formally,

Figures (12)

Figure 1: Averaged values of $\mathbf{e}$ (black line) and $\mathbf{r}_t^2(\mathbf{x}_t)$ (blue line) on FFHQ (Left) and ImageNet (Right).
Figure 2: Visualization of $\mathbf{e}$ and $\hat{\mathbf{r}}_t^2(\mathbf{x}_t)$ of an example image $\mathbf{x}_0$ at different $t$. Top: $\mathbf{e}$; Bottom: $\hat{\mathbf{r}}_t^2(\mathbf{x}_t)$. The results are averaged over RGB channels for better visualization.
Figure 3: LPIPS comparisons on FFHQ for Type II guidance. For DiffPIR, we report LPIPS under different $\lambda$.
Figure 4: LPIPS comparisons on FFHQ for DPS with heuristic step size $\zeta_t = \zeta / \lVert \mathbf{y} - \mathbf{A}D_t(\mathbf{x}_t)\rVert_2$. For comprehensive comparisons, we report LPIPS under different $\zeta$.
Figure 5: Qualitative results for Table \ref{['tab:typeIquant']} on FFHQ dataset. We observed that our methods reconstruct fine details of the image more faithfully compared to baselines.
...and 7 more figures

Theorems & Definitions (24)

Proposition 3.1
proof
Proposition 3.2
proof
Proposition 4.1: Fixed-point solutions of variational Gaussian posterior
proof
Theorem 4.2: Fixed-point solutions of DDPM
proof
Lemma 1.1: Tweedie's formula
proof
...and 14 more

Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

TL;DR

Abstract

Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (24)