Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems
Sojin Lee, Dogyun Park, Inho Kong, Hyunwoo J. Kim
TL;DR
This work tackles the challenge of noisy inverse problems by replacing per-sample optimization with a diffusion-prior, amortized variational inference framework. DAVI learns an implicit posterior mapping ${q_{\phi}}({\mathbf{x}_0}|{\mathbf{y}})$ from measurements to clean data, enabling single-step posterior sampling ${\hat{\mathbf{x}} = {\mathcal{I}}_{\phi}({\mathbf{y}} + h{\mathbf{z}})}$ and generalization to unseen measurements. It introduces an integral KL objective (IKL) and a Perturbed Posterior Bridge (PPB) to stabilize training and improve generalization, along with an alternating optimization scheme involving an implicit score ${s_\psi}$ and a pre-trained diffusion score ${s_\theta}$. Empirically, DAVI achieves state-of-the-art performance on Gaussian deblurring, 4× super-resolution, and box inpainting across FFHQ and ImageNet, delivering improved FID/LPIPS while maintaining competitive PSNR and delivering fast inference (≈0.04 s/image). The approach has practical implications for scalable, real-time inversion in imaging and related domains, with ethical considerations regarding potential privacy implications of restored imagery.
Abstract
Recent studies on inverse problems have proposed posterior samplers that leverage the pre-trained diffusion models as powerful priors. These attempts have paved the way for using diffusion models in a wide range of inverse problems. However, the existing methods entail computationally demanding iterative sampling procedures and optimize a separate solution for each measurement, which leads to limited scalability and lack of generalization capability across unseen samples. To address these limitations, we propose a novel approach, Diffusion prior-based Amortized Variational Inference (DAVI) that solves inverse problems with a diffusion prior from an amortized variational inference perspective. Specifically, instead of separate measurement-wise optimization, our amortized inference learns a function that directly maps measurements to the implicit posterior distributions of corresponding clean data, enabling a single-step posterior sampling even for unseen measurements. Extensive experiments on image restoration tasks, e.g., Gaussian deblur, 4$\times$ super-resolution, and box inpainting with two benchmark datasets, demonstrate our approach's superior performance over strong baselines. Code is available at https://github.com/mlvlab/DAVI.
