Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model
Yuhan Wang, Suzhi Bi, Ying-Jun Angela Zhang, Xiaojun Yuan
TL;DR
This paper tackles the distortion-perception (DP) tradeoff in inverse problems by introducing a variance-scaled reverse diffusion framework that uses a single pre-trained score-based model to flexibly traverse the DP curve. The authors characterize the marginal distributions under scaled reverse diffusion, proving optimality for conditional Gaussian cases and showing that end-point mean aligns with the MMSE solution while the end-point covariance scales with the variance factor. They implement a practical sampling procedure via DPS to approximate conditional scores, enabling inference-time control over the DP tradeoff without retraining and demonstrating superior DP coverage on 2D distributions and FFHQ images compared with GAN-based and diffusion-based baselines. The results suggest that a single score network can robustly handle varying measurements and noise levels, providing a flexible and scalable approach for general inverse problems such as Gaussian deblurring and strong super-resolution. Overall, the work advances efficient, principled DP navigation in diffusion-based restoration, with potential impact on real-time image enhancement and adaptable denoising frameworks.
Abstract
The distortion-perception (DP) tradeoff reveals a fundamental conflict between distortion metrics (e.g., MSE and PSNR) and perceptual quality. Recent research has increasingly concentrated on evaluating denoising algorithms within the DP framework. However, existing algorithms either prioritize perceptual quality by sacrificing acceptable distortion, or focus on minimizing MSE for faithful restoration. When the goal shifts or noisy measurements vary, adapting to different points on the DP plane needs retraining or even re-designing the model. Inspired by recent advances in solving inverse problems using score-based generative models, we explore the potential of flexibly and optimally traversing DP tradeoffs using a single pre-trained score-based model. Specifically, we introduce a variance-scaled reverse diffusion process and theoretically characterize the marginal distribution. We then prove that the proposed sample process is an optimal solution to the DP tradeoff for conditional Gaussian distribution. Experimental results on two-dimensional and image datasets illustrate that a single score network can effectively and flexibly traverse the DP tradeoff for general denoising problems.
