Kernel Density Steering: Inference-Time Scaling via Mode Seeking for Image Restoration
Yuyang Hu, Kangfu Mei, Mojtaba Sahraee-Ardakan, Ulugbek S. Kamilov, Peyman Milanfar, Mauricio Delbracio
TL;DR
Kernel Density Steering (KDS) introduces an inference-time, plug-and-play framework for diffusion-based image restoration that guides an N-particle latent ensemble toward high-density posterior modes using patch-wise KDE gradients. By performing collective mode seeking via a mean-shift-like update, KDS reduces artifacts and improves both distortion and perceptual quality without retraining or degradation-model knowledge. Empirical results across real-world super-resolution and inpainting tasks show consistent gains in PSNR/SSIM and perceptual metrics, along with robustness to hyperparameters. The approach offers a scalable, model-agnostic enhancement to diffusion samplers with practical impact for high-fidelity IR in diverse real-world settings.
Abstract
Diffusion models show promise for image restoration, but existing methods often struggle with inconsistent fidelity and undesirable artifacts. To address this, we introduce Kernel Density Steering (KDS), a novel inference-time framework promoting robust, high-fidelity outputs through explicit local mode-seeking. KDS employs an $N$-particle ensemble of diffusion samples, computing patch-wise kernel density estimation gradients from their collective outputs. These gradients steer patches in each particle towards shared, higher-density regions identified within the ensemble. This collective local mode-seeking mechanism, acting as "collective wisdom", steers samples away from spurious modes prone to artifacts, arising from independent sampling or model imperfections, and towards more robust, high-fidelity structures. This allows us to obtain better quality samples at the expense of higher compute by simultaneously sampling multiple particles. As a plug-and-play framework, KDS requires no retraining or external verifiers, seamlessly integrating with various diffusion samplers. Extensive numerical validations demonstrate KDS substantially improves both quantitative and qualitative performance on challenging real-world super-resolution and image inpainting tasks.
