One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

Jianze Li; Jiezhang Cao; Yong Guo; Wenbo Li; Yulun Zhang

One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

Jianze Li, Jiezhang Cao, Yong Guo, Wenbo Li, Yulun Zhang

TL;DR

Diffusion-based Real-ISR methods offer high realism but suffer from expensive multi-step inference. FluxSR presents a one-step Real-ISR framework built on FLUX.1-dev and introduces Flow Trajectory Distillation to transfer the multi-step SR flow from a large T2I model while keeping the teacher flow fixed, enabling offline distillation with a single-step generator. The approach is augmented with TV-LPIPS and Attention Diversification Loss to mitigate high-frequency artifacts, and a large-model friendly training strategy that avoids online teacher inference. Empirical results show FluxSR achieves state-of-the-art performance among one-step methods and competitive realism against multi-step baselines on real-world datasets, at the cost of higher parameter count and compute.

Abstract

Diffusion models (DMs) have significantly advanced the development of real-world image super-resolution (Real-ISR), but the computational cost of multi-step diffusion models limits their application. One-step diffusion models generate high-quality images in a one sampling step, greatly reducing computational overhead and inference latency. However, most existing one-step diffusion methods are constrained by the performance of the teacher model, where poor teacher performance results in image artifacts. To address this limitation, we propose FluxSR, a novel one-step diffusion Real-ISR technique based on flow matching models. We use the state-of-the-art diffusion model FLUX.1-dev as both the teacher model and the base model. First, we introduce Flow Trajectory Distillation (FTD) to distill a multi-step flow matching model into a one-step Real-ISR. Second, to improve image realism and address high-frequency artifact issues in generated images, we propose TV-LPIPS as a perceptual loss and introduce Attention Diversification Loss (ADL) as a regularization term to reduce token similarity in transformer, thereby eliminating high-frequency artifacts. Comprehensive experiments demonstrate that our method outperforms existing one-step diffusion-based Real-ISR methods. The code and model will be released at https://github.com/JianzeLi-114/FluxSR.

One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

TL;DR

Abstract

One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)