Deep End-to-End Posterior ENergy (DEEPEN) for image recovery
Jyothi Rikhab Chand, Mathews Jacob
TL;DR
DEEPEN introduces an end-to-end learned energy-based model to represent the posterior in MRI image reconstruction, enabling both MAP estimation and posterior sampling. By modeling the prior with a neural energy ${\mathcal{E}}_{\boldsymbol{\theta}}$ and training via maximum likelihood, it yields a negative log-posterior ${\mathcal{L}}_{\boldsymbol{\theta}}({\boldsymbol{x}}) = \frac{1}{2}\|\mathbf{A}{\boldsymbol{x}}-\boldsymbol{b}\|^2 + \mathcal{E}_{\boldsymbol{\theta}}({\boldsymbol{x}}) + \log \tilde{Z}_{\boldsymbol{\theta}}$, and supports Langevin-based posterior sampling with an efficient gradient-based update. The method avoids algorithm unrolling, does not impose contraction constraints, and demonstrates improved MAP reconstruction over prior E2E and PnP approaches, plus faster sampling than diffusion models with substantially fewer parameters. Empirical results on fastMRI data show robust generalization to unseen acquisition settings, competitive reconstruction quality across acceleration factors, and meaningful uncertainty estimates from generated samples. Overall, DEEPEN provides a scalable, memory-efficient pathway to both high-quality image recovery and uncertainty quantification in MRI, with practical implications for adaptive acquisition and clinical decision-making.
Abstract
Current end-to-end (E2E) and plug-and-play (PnP) image reconstruction algorithms approximate the maximum a posteriori (MAP) estimate but cannot offer sampling from the posterior distribution, like diffusion models. By contrast, it is challenging for diffusion models to be trained in an E2E fashion. This paper introduces a Deep End-to-End Posterior ENergy (DEEPEN) framework, which enables MAP estimation as well as sampling. We learn the parameters of the posterior, which is the sum of the data consistency error and the negative log-prior distribution, using maximum likelihood optimization in an E2E fashion. The proposed approach does not require algorithm unrolling, and hence has a smaller computational and memory footprint than current E2E methods, while it does not require contraction constraints typically needed by current PnP methods. Our results demonstrate that DEEPEN offers improved performance than current E2E and PnP models in the MAP setting, while it also offers faster sampling compared to diffusion models. In addition, the learned energy-based model is observed to be more robust to changes in image acquisition settings.
