Blind Inversion using Latent Diffusion Priors
Weimin Bai, Siyi Chen, Wenzheng Chen, He Sun
TL;DR
This work tackles blind inverse problems where the forward operator is unknown by introducing LatentDEM, which embeds powerful latent diffusion priors into a variational EM framework to jointly estimate the hidden signal $\boldsymbol{x}$ and forward model parameters $\boldsymbol{\phi}$. The E-step performs posterior sampling in latent space with an annealing-consistent data term, while the M-step updates $\boldsymbol{\phi}$ via MAP estimation; a skip-gradient strategy further accelerates training. The method demonstrates strong 2D performance on blind motion deblurring and extends to non-linear 3D inverse rendering for pose-free sparse-view reconstruction, achieving improved view-consistency and novel-view quality. These results show that leveraging latent diffusion priors within EM enables robust blind inversion across 2D and 3D tasks, with practical impact on imaging and rendering where forward models are imperfect or unavailable.
Abstract
Diffusion models have emerged as powerful tools for solving inverse problems due to their exceptional ability to model complex prior distributions. However, existing methods predominantly assume known forward operators (i.e., non-blind), limiting their applicability in practical settings where acquiring such operators is costly. Additionally, many current approaches rely on pixel-space diffusion models, leaving the potential of more powerful latent diffusion models (LDMs) underexplored. In this paper, we introduce LatentDEM, an innovative technique that addresses more challenging blind inverse problems using latent diffusion priors. At the core of our method is solving blind inverse problems within an iterative Expectation-Maximization (EM) framework: (1) the E-step recovers clean images from corrupted observations using LDM priors and a known forward model, and (2) the M-step estimates the forward operator based on the recovered images. Additionally, we propose two novel optimization techniques tailored for LDM priors and EM frameworks, yielding more accurate and efficient blind inversion results. As a general framework, LatentDEM supports both linear and non-linear inverse problems. Beyond common 2D image restoration tasks, it enables new capabilities in non-linear 3D inverse rendering problems. We validate LatentDEM's performance on representative 2D blind deblurring and 3D sparse-view reconstruction tasks, demonstrating its superior efficacy over prior arts.
