Inverse problem regularization with hierarchical variational autoencoders

Jean Prost; Antoine Houdard; Andrés Almansa; Nicolas Papadakis

Inverse problem regularization with hierarchical variational autoencoders

Jean Prost, Antoine Houdard, Andrés Almansa, Nicolas Papadakis

TL;DR

This work addresses ill-posed inverse problems by regularizing with a hierarchical variational autoencoder (HVAE) prior. It introduces PnP-HVAE, an encoder-guided, joint-posterior optimization that avoids backpropagation through the generator and leverages temperature scaling to control regularization strength across HVAE levels. The method provides convergence guarantees under contractivity assumptions and demonstrates strong restoration performance on both face datasets (via a pre-trained VDVAE) and natural images using a patch-based HVAE (PatchVDVAE). Overall, PnP-HVAE achieves competitive results with state-of-the-art denoiser-based and generative-model-based methods while enabling restoration across image sizes and maintaining fidelity to observations.

Abstract

In this paper, we propose to regularize ill-posed inverse problems using a deep hierarchical variational autoencoder (HVAE) as an image prior. The proposed method synthesizes the advantages of i) denoiser-based Plug \& Play approaches and ii) generative model based approaches to inverse problems. First, we exploit VAE properties to design an efficient algorithm that benefits from convergence guarantees of Plug-and-Play (PnP) methods. Second, our approach is not restricted to specialized datasets and the proposed PnP-HVAE model is able to solve image restoration problems on natural images of any size. Our experiments show that the proposed PnP-HVAE method is competitive with both SOTA denoiser-based PnP approaches, and other SOTA restoration methods based on generative models.

Inverse problem regularization with hierarchical variational autoencoders

TL;DR

Abstract

Paper Structure (42 sections, 9 theorems, 67 equations, 17 figures, 4 tables, 2 algorithms)

This paper contains 42 sections, 9 theorems, 67 equations, 17 figures, 4 tables, 2 algorithms.

Introduction
Contributions and outline
Related works
Plug-and-Play methods
Deep generative models for inverse problems
Background on variational autoencoders
VAE training
Hierarchical variational autoencoders
Temperature scaling
Regularization with HVAE Prior
Tempered hierarchical joint posterior
Encoder approximation of the joint posterior
Alternate optimization with PnP-HVAE
Convergence analysis
Plug-and-Play HVAE
...and 27 more sections

Key Result

Proposition 1

Assume the decoder is defined as in eq:decoder_constant. Denote $\text{HVAE}(\bm{x}, \bm{\tau}) := \mu_{\theta}\left(E_{\bm{\tau}}\left(\bm{x}\right)\right)$. Then the alternate scheme described in Algorithm algo:final writes

Figures (17)

Figure 1: Numerical estimation of the Lipschitz constant of PatchVDVAE reconstruction with different temperatures $\tau$. We present the histogram of ratio values $\frac{||\text{HVAE}(\bf{u}, \tau)-\text{HVAE}(\bf{v}, \tau)||}{||\bf{u}-\bf{v}||}$, where $\bf{u}$ and $\bf{v}$ are natural images corrupted with white Gaussian noise of different standard deviations $\sigma$. For noisy images ($\sigma>0)$, the observed Lipschitz constant is always less than $1$.
Figure 2: Comparison of the convergence of PnP-HVAE algorithm \ref{['algo:final']} with respect to the baseline Adam optimizer, on a deblurring problem. Left (Convergence of the function value): PnP-HVAE converges faster to a minimum of the joint posterior $J_1(\bm{x}_k,\bm{z}_k)$ in \ref{['eq:J1']}. Right (Convergence of iterates $\bm{x}_k$): PnP-HVAE is more stable than Adam.
Figure 3: Visual comparaison of image restoration methods based on deep generative models. We studied $3$ tasks on face images: inpainting (top), deblurring (middle), super-resolution (bottom). Contrary to the optimization of the objective \ref{['eq:J1']} with Adam, our alternate algorithm generates realistic results, on par with ILO daras2021intermediate, while remaining consistant with the observation.
Figure 4: Left: $64\times64$ patches samples from our patchVDVAE model trained on patches from natural images. Right: PatchVDVAE is fully convolutional and it can generate images of higher resolution (here: $128\times128$).
Figure 5: Natural image deblurring
...and 12 more figures

Theorems & Definitions (9)

Proposition 1
Proposition 2: Proof in supplementary
Corollary 1
Proposition 3: Proof in supplementary
Proposition 4
Proposition 5: Algorithm \ref{['algo:hierarchical_latent_reg']} computes the global minimum of $J_2(\bm{x},\bm{z})$ with respect to $\bm{z}$
Proposition 6: Tweedie's formula for HVAEs.
Proposition 7
Corollary 2

Inverse problem regularization with hierarchical variational autoencoders

TL;DR

Abstract

Inverse problem regularization with hierarchical variational autoencoders

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (17)

Theorems & Definitions (9)