TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
Linwei Dong, Qingnan Fan, Yihong Guo, Zhonghao Wang, Qi Zhang, Jinwei Chen, Yawei Luo, Changqing Zou
TL;DR
TSD-SR addresses Real-ISR by distilling a pre-trained diffusion prior into a fast one-step model that preserves realistic textures. It introduces Target Score Distillation (TSD), which combines Target Score Matching with the diffusion prior to provide stable, HQ-guided gradients, and a Distribution-Aware Sampling Module (DASM) to emphasize early timesteps for detail recovery. The approach yields superior perceptual restoration with substantially faster inference than prior diffusion-based methods, and ablations confirm the effectiveness of both TSM and DASM. This work advances practical Real-ISR by delivering high-quality results suitable for real-world deployment and sets the stage for future efficiency improvements via model compression.
Abstract
Pre-trained text-to-image diffusion models are increasingly applied to real-world image super-resolution (Real-ISR) task. Given the iterative refinement nature of diffusion models, most existing approaches are computationally expensive. While methods such as SinSR and OSEDiff have emerged to condense inference steps via distillation, their performance in image restoration or details recovery is not satisfied. To address this, we propose TSD-SR, a novel distillation framework specifically designed for real-world image super-resolution, aiming to construct an efficient and effective one-step model. We first introduce the Target Score Distillation, which leverages the priors of diffusion models and real image references to achieve more realistic image restoration. Secondly, we propose a Distribution-Aware Sampling Module to make detail-oriented gradients more readily accessible, addressing the challenge of recovering fine details. Extensive experiments demonstrate that our TSD-SR has superior restoration results (most of the metrics perform the best) and the fastest inference speed (e.g. 40 times faster than SeeSR) compared to the past Real-ISR approaches based on pre-trained diffusion priors.
