Table of Contents
Fetching ...

Inference-Time Search using Side Information for Diffusion-based Image Reconstruction

Mahdi Farahbakhsh, Vishnu Teja Kunde, Dileep Kalathil, Krishna Narayanan, Jean-Francois Chamberland

TL;DR

This work tackles ill-posed inverse problems by incorporating side information at inference time using a training-free, modality-agnostic framework. A reward function $r({f x}_0;{f s})$ tilts the unconditional diffusion prior via $p_{0|S}({f x}_0|{f s}) \\propto p_0({f x}_0) \,e^{r({f x}_0;{f s})/\tau}$, and the conditional reverse-score includes a value term $\nabla_{{\bf x}_t} V_t^{\tau}({\bf x}_t;{f s},{\bf y})$ approximated with DPS-style methods to avoid expensive backprop through the denoiser. The authors introduce two inference-time search strategies, Greedy Search (GS) and Recursive Fork-Join Search (RFJS), which use group-based resampling schedules to balance exploration and exploitation in the particle set. Experiments across six inverse-problem tasks (e.g., box inpainting, super-resolution, motion/Gaussian/nonlinear/deblurring) and multiple side-information modalities (images, text) show that the proposed methods consistently outperform strong baselines, including reward-gradient-guided approaches, while maintaining computational efficiency. The results demonstrate that leveraging side information at inference time can substantially improve reconstruction fidelity and semantic alignment in diffusion-based solvers, with practical impact for multimodal and measurement-limited imaging problems.

Abstract

Diffusion models have emerged as powerful priors for solving inverse problems. However, existing approaches typically overlook side information that could significantly improve reconstruction quality, especially in severely ill-posed settings. In this work, we propose a novel inference-time search algorithm that guides the sampling process using the side information in a manner that balances exploration and exploitation. This enables more accurate and reliable reconstructions, providing an alternative to the gradient-based guidance that is prone to reward-hacking artifacts. Our approach can be seamlessly integrated into a wide range of existing diffusion-based image reconstruction pipelines. Through extensive experiments on a number of inverse problems, such as box inpainting, super-resolution, and various deblurring tasks including motion, Gaussian, nonlinear, and blind deblurring, we show that our approach consistently improves the qualitative and quantitative performance of diffusion-based image reconstruction algorithms. We also show the superior performance of our approach with respect to other baselines, including reward gradient-based guidance algorithms. The code is available at \href{https://github.com/mhdfb/sideinfo-search-reconstruction}{this repository}.

Inference-Time Search using Side Information for Diffusion-based Image Reconstruction

TL;DR

This work tackles ill-posed inverse problems by incorporating side information at inference time using a training-free, modality-agnostic framework. A reward function tilts the unconditional diffusion prior via , and the conditional reverse-score includes a value term approximated with DPS-style methods to avoid expensive backprop through the denoiser. The authors introduce two inference-time search strategies, Greedy Search (GS) and Recursive Fork-Join Search (RFJS), which use group-based resampling schedules to balance exploration and exploitation in the particle set. Experiments across six inverse-problem tasks (e.g., box inpainting, super-resolution, motion/Gaussian/nonlinear/deblurring) and multiple side-information modalities (images, text) show that the proposed methods consistently outperform strong baselines, including reward-gradient-guided approaches, while maintaining computational efficiency. The results demonstrate that leveraging side information at inference time can substantially improve reconstruction fidelity and semantic alignment in diffusion-based solvers, with practical impact for multimodal and measurement-limited imaging problems.

Abstract

Diffusion models have emerged as powerful priors for solving inverse problems. However, existing approaches typically overlook side information that could significantly improve reconstruction quality, especially in severely ill-posed settings. In this work, we propose a novel inference-time search algorithm that guides the sampling process using the side information in a manner that balances exploration and exploitation. This enables more accurate and reliable reconstructions, providing an alternative to the gradient-based guidance that is prone to reward-hacking artifacts. Our approach can be seamlessly integrated into a wide range of existing diffusion-based image reconstruction pipelines. Through extensive experiments on a number of inverse problems, such as box inpainting, super-resolution, and various deblurring tasks including motion, Gaussian, nonlinear, and blind deblurring, we show that our approach consistently improves the qualitative and quantitative performance of diffusion-based image reconstruction algorithms. We also show the superior performance of our approach with respect to other baselines, including reward gradient-based guidance algorithms. The code is available at \href{https://github.com/mhdfb/sideinfo-search-reconstruction}{this repository}.

Paper Structure

This paper contains 31 sections, 3 theorems, 27 equations, 20 figures, 5 tables, 1 algorithm.

Key Result

Proposition 1

Let $p_{t\mid t+1, Y, S}$ denote the conditional posterior distribution for the reverse diffusion process. Then using eq:proxy-posterior we have where $V_t^\tau(\mathbf{x}_t; \mathbf{s}, \mathbf{y}) \triangleq \log \mathbb{E}_{\mathbf{x}_0 \sim p_{0\mid t, Y}(\cdot \mid \mathbf{x}_t, \mathbf{y})}[\exp(r(\mathbf{x}_0; \mathbf{s})/\tau)]$.

Figures (20)

  • Figure 1: Illustration of the performance of our inference-time search algorithm for using side information in solving inverse problems, compared with the DPS algorithm chung2023dps.
  • Figure 2: Illustration of the group size resampling strategies of different search algorithms.
  • Figure 3: Image as side information: Qualitative illustration of the performance of our RFJS algorithm compared to the DPS baseline on linear and nonlinear inverse problems. RFJS is able to capture many details that are missed by the DPS baseline to achieve a superior reconstruction quality.
  • Figure 4: Text as side information : Qualitative illustration of the performance of our RFJS algorithm compared to the DPS baseline. For example, the side information provided for the super resolution task is 'golden retriever sitting on a snowy frozen lake, facing forward'. RFJS is able to capture many details that are missed by the DPS baseline to achieve a superior reconstruction quality.
  • Figure 5: Contrast Image as Side Information: Qualitative MRI reconstruction with RFJS vs. ContextMRI. The shapes and line edges are well preserved in our reconstruction.
  • ...and 15 more figures

Theorems & Definitions (4)

  • Proposition 1
  • Lemma 2
  • Proposition 3
  • Remark 4