DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency
Zalan Fabian, Berk Tinaz, Mahdi Soltanolkotabi
TL;DR
Dirac proposes a diffusion-based inverse-problem solver that explicitly models the observed degraded measurement as a stochastic degradation process and then reverses this process to recover the clean signal while preserving data-consistency. The approach jointly learns a score network from degraded pairs, introduces an incremental reconstruction loss to stabilize reverse steps, and utilizes a degradation-scheduling strategy for controlled trade-offs between perceptual quality and distortion metrics. The framework yields state-of-the-art results on high-resolution deblurring and inpainting, with fast sampling and built-in data fidelity, validated on CelebA-HQ and ImageNet against strong baselines. A key limitation is the need to train a dedicated model per inverse problem, but the method offers flexible early-stopping and robust performance across test-time variations.
Abstract
Diffusion models have established new state of the art in a multitude of computer vision tasks, including image restoration. Diffusion-based inverse problem solvers generate reconstructions of exceptional visual quality from heavily corrupted measurements. However, in what is widely known as the perception-distortion trade-off, the price of perceptually appealing reconstructions is often paid in declined distortion metrics, such as PSNR. Distortion metrics measure faithfulness to the observation, a crucial requirement in inverse problems. In this work, we propose a novel framework for inverse problem solving, namely we assume that the observation comes from a stochastic degradation process that gradually degrades and noises the original clean image. We learn to reverse the degradation process in order to recover the clean image. Our technique maintains consistency with the original measurement throughout the reverse process, and allows for great flexibility in trading off perceptual quality for improved distortion metrics and sampling speedup via early-stopping. We demonstrate the efficiency of our method on different high-resolution datasets and inverse problems, achieving great improvements over other state-of-the-art diffusion-based methods with respect to both perceptual and distortion metrics.
