Solving Inverse Problems by Joint Posterior Maximization with Autoencoding Prior
Mario González, Andrés Almansa, Pauline Tan
TL;DR
The paper tackles ill-posed imaging inverse problems by leveraging a variational autoencoder (VAE) prior and introducing Joint Posterior Maximization (JPMAP) over the image and latent code. It demonstrates that the joint objective is quasi-bi-convex, enabling an alternating optimization that converges to a stationary point under mild assumptions, while training the VAE with a denoising criterion improves robustness to out-of-distribution inputs. A continuation scheme further stabilizes the optimization and helps recover robust MAP estimates across diverse degradations. Empirical results on denoising, interpolation, deblurring, super-resolution, and compressed sensing show JPMAP achieving higher restoration quality than competing non-convex MAP methods, with insights on encoder initialization and scalability to higher-dimensional data via future generative-model improvements.
Abstract
In this work we address the problem of solving ill-posed inverse problems in imaging where the prior is a variational autoencoder (VAE). Specifically we consider the decoupled case where the prior is trained once and can be reused for many different log-concave degradation models without retraining. Whereas previous MAP-based approaches to this problem lead to highly non-convex optimization algorithms, our approach computes the joint (space-latent) MAP that naturally leads to alternate optimization algorithms and to the use of a stochastic encoder to accelerate computations. The resulting technique (JPMAP) performs Joint Posterior Maximization using an Autoencoding Prior. We show theoretical and experimental evidence that the proposed objective function is quite close to bi-convex. Indeed it satisfies a weak bi-convexity property which is sufficient to guarantee that our optimization scheme converges to a stationary point. We also highlight the importance of correctly training the VAE using a denoising criterion, in order to ensure that the encoder generalizes well to out-of-distribution images, without affecting the quality of the generative model. This simple modification is key to providing robustness to the whole procedure. Finally we show how our joint MAP methodology relates to more common MAP approaches, and we propose a continuation scheme that makes use of our JPMAP algorithm to provide more robust MAP estimates. Experimental results also show the higher quality of the solutions obtained by our JPMAP approach with respect to other non-convex MAP approaches which more often get stuck in spurious local optima.
