An Empirical Study of Sampling Hyperparameters in Diffusion-Based Super-Resolution
Yudhistira Arief Wibowo
TL;DR
The paper investigates how conditioning hyperparameters affect diffusion-based single-image super-resolution, focusing on Diffusion Posterior Sampling (DPS) and Manifold Constrained Gradient (MCG) as conditioning schemes. Using a fixed pretrained diffusion prior on FFHQ, it shows that the conditioning step size largely governs performance, with optimal values around 2.0–3.0, while the number of diffusion steps plays a secondary role. DPS consistently yields better overall reconstructions than the baseline and MCG, which improves pixel-level fidelity but can harm perceptual quality. The work highlights the practical importance of tuning conditioning parameters in diffusion-based inverse problems and notes dataset-specific considerations for generalization.
Abstract
Diffusion models have shown strong potential for solving inverse problems such as single-image super-resolution, where a high-resolution image is recovered from a low-resolution observation using a pretrained unconditional prior. Conditioning methods, including Diffusion Posterior Sampling (DPS) and Manifold Constrained Gradient (MCG), can substantially improve reconstruction quality, but they introduce additional hyperparameters that require careful tuning. In this work, we conduct an empirical ablation study on FFHQ super-resolution to identify the dominant factors affecting performance when applying conditioning to pretrained diffusion models, and show that the conditioning step size has a significantly greater impact than the diffusion step count, with step sizes in the range of [2.0, 3.0] yielding the best overall performance in our experiments.
