OSDFace: One-Step Diffusion Model for Face Restoration
Jingkai Wang, Jue Gong, Lin Zhang, Zheng Chen, Xing Liu, Hong Gu, Yutong Liu, Yulun Zhang, Xiaokang Yang
TL;DR
OSDFace introduces a one-step diffusion framework for face restoration that achieves high fidelity with fast inference. It combines a Visual Representation Embedder (VRE) to extract priors from low-quality faces and a single denoising step guided by a learnable prompt, enabling efficient HQ reconstruction. An ArcFace-based facial identity loss and GAN guidance further ensure identity preservation and distribution alignment with ground truth. Across synthetic and real-world datasets, OSDFace attains state-of-the-art perceptual and fidelity metrics while reducing computation, illustrating the practical viability of priors-informed one-step diffusion for faces.
Abstract
Diffusion models have demonstrated impressive performance in face restoration. Yet, their multi-step inference process remains computationally intensive, limiting their applicability in real-world scenarios. Moreover, existing methods often struggle to generate face images that are harmonious, realistic, and consistent with the subject's identity. In this work, we propose OSDFace, a novel one-step diffusion model for face restoration. Specifically, we propose a visual representation embedder (VRE) to better capture prior information and understand the input face. In VRE, low-quality faces are processed by a visual tokenizer and subsequently embedded with a vector-quantized dictionary to generate visual prompts. Additionally, we incorporate a facial identity loss derived from face recognition to further ensure identity consistency. We further employ a generative adversarial network (GAN) as a guidance model to encourage distribution alignment between the restored face and the ground truth. Experimental results demonstrate that OSDFace surpasses current state-of-the-art (SOTA) methods in both visual quality and quantitative metrics, generating high-fidelity, natural face images with high identity consistency. The code and model will be released at https://github.com/jkwang28/OSDFace.
