Robust Posterior Diffusion-based Sampling via Adaptive Guidance Scale
Liav Hen, Tom Tirer, Raja Giryes, Shady Abu-Hussein
TL;DR
AdaPS addresses the challenge of balancing the diffusion prior with data fidelity in posterior sampling for inverse problems by introducing a hyperparameter-free, adaptive guidance scale. The method reformulates DDIM for conditional sampling and uses an alignment-based update that combines a Jacobian-free magnitude estimate with a potentially Jacobian-aware direction, via a MAP-based surrogate. This yields robust, perceptually superior reconstructions across tasks like super-resolution and deblurring, achieving best or near-best LPIPS with minimal PSNR loss and demonstrating stability as the number of diffusion steps changes. The approach scales naturally with the diffusion schedule and stochasticity, obviating task-specific hyperparameter tuning and offering a practical, efficient pathway for diffusion priors in real-world inverse problems.
Abstract
Diffusion models have recently emerged as powerful generative priors for solving inverse problems, achieving state-of-the-art results across various imaging tasks. A central challenge in this setting lies in balancing the contribution of the prior with the data fidelity term: overly aggressive likelihood updates may introduce artifacts, while conservative updates can slow convergence or yield suboptimal reconstructions. In this work, we propose an adaptive likelihood step-size strategy to guide the diffusion process for inverse-problem formulations. Specifically, we develop an observation-dependent weighting scheme based on the agreement between two different approximations of the intractable intermediate likelihood gradients, that adapts naturally to the diffusion schedule, time re-spacing, and injected stochasticity. The resulting approach, Adaptive Posterior diffusion Sampling (AdaPS), is hyperparameter-free and improves reconstruction quality across diverse imaging tasks - including super-resolution, Gaussian deblurring, and motion deblurring - on CelebA-HQ and ImageNet-256 validation sets. AdaPS consistently surpasses existing diffusion-based baselines in perceptual quality with minimal or no loss in distortion, without any task-specific tuning. Extensive ablation studies further demonstrate its robustness to the number of diffusion steps, observation noise levels, and varying stochasticity.
