DifFRelight: Diffusion-Based Facial Performance Relighting

Mingming He; Pascal Clausen; Ahmet Levent Taşel; Li Ma; Oliver Pilarski; Wenqi Xian; Laszlo Rikker; Xueming Yu; Ryan Burgert; Ning Yu; Paul Debevec

DifFRelight: Diffusion-Based Facial Performance Relighting

Mingming He, Pascal Clausen, Ahmet Levent Taşel, Li Ma, Oliver Pilarski, Wenqi Xian, Laszlo Rikker, Xueming Yu, Ryan Burgert, Ning Yu, Paul Debevec

TL;DR

This work tackles the challenge of relighting free-viewpoint facial performances captured under a single flat lighting setup. It introduces a subject-specific diffusion-based relighting pipeline that uses paired flat-lit and OLAT data, with lighting encoded via Spherical Harmonics, and augments this with scalable dynamic 3D Gaussian Splatting to render novel viewpoints. Key contributions include a subject-specific diffusion model with spatial and global conditioning, a two-stage deformable 3DGS for long sequences, and a unified lighting framework that supports area-light and HDRI environment lighting. The approach delivers photorealistic relighting that preserves identity and fine details (skin, eyes, hair) and demonstrates real-world HDRI relighting, offering a practical pathway for postproduction relighting of flat-lit footage without extensive multi-light capture.

Abstract

We present a novel framework for free-viewpoint facial performance relighting using diffusion-based image-to-image translation. Leveraging a subject-specific dataset containing diverse facial expressions captured under various lighting conditions, including flat-lit and one-light-at-a-time (OLAT) scenarios, we train a diffusion model for precise lighting control, enabling high-fidelity relit facial images from flat-lit inputs. Our framework includes spatially-aligned conditioning of flat-lit captures and random noise, along with integrated lighting information for global control, utilizing prior knowledge from the pre-trained Stable Diffusion model. This model is then applied to dynamic facial performances captured in a consistent flat-lit environment and reconstructed for novel-view synthesis using a scalable dynamic 3D Gaussian Splatting method to maintain quality and consistency in the relit results. In addition, we introduce unified lighting control by integrating a novel area lighting representation with directional lighting, allowing for joint adjustments in light size and direction. We also enable high dynamic range imaging (HDRI) composition using multiple directional lights to produce dynamic sequences under complex lighting conditions. Our evaluations demonstrate the models efficiency in achieving precise lighting control and generalizing across various facial expressions while preserving detailed features such as skintexture andhair. The model accurately reproduces complex lighting effects like eye reflections, subsurface scattering, self-shadowing, and translucency, advancing photorealism within our framework.

DifFRelight: Diffusion-Based Facial Performance Relighting

TL;DR

Abstract

DifFRelight: Diffusion-Based Facial Performance Relighting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (22)