Light Up Your Face: A Physically Consistent Dataset and Diffusion Model for Face Fill-Light Enhancement
Jue Gong, Zihan Zhou, Jingkai Wang, Xiaohong Liu, Yulun Zhang, Xiaokang Yang
TL;DR
This work tackles face fill-light enhancement by adding a virtual fill light that preserves the original background. It introduces LYF-160K, a large-scale, physically consistent dataset created with a disk-shaped 6D fill-light parameterization, and a two-stage framework comprising PALP for physics-informed conditioning and FiLitDiff, a one-step diffusion model, for fast, controllable FFE. The approach demonstrates strong perceptual quality and competitive full-reference metrics while better preserving background illumination, outperforming several baselines and ablations. This dataset and framework provide a practical, physically grounded path for controllable portrait illumination editing with potential impact on photography and video applications.
Abstract
Face fill-light enhancement (FFE) brightens underexposed faces by adding virtual fill light while keeping the original scene illumination and background unchanged. Most face relighting methods aim to reshape overall lighting, which can suppress the input illumination or modify the entire scene, leading to foreground-background inconsistency and mismatching practical FFE needs. To support scalable learning, we introduce LightYourFace-160K (LYF-160K), a large-scale paired dataset built with a physically consistent renderer that injects a disk-shaped area fill light controlled by six disentangled factors, producing 160K before-and-after pairs. We first pretrain a physics-aware lighting prompt (PALP) that embeds the 6D parameters into conditioning tokens, using an auxiliary planar-light reconstruction objective. Building on a pretrained diffusion backbone, we then train a fill-light diffusion (FiLitDiff), an efficient one-step model conditioned on physically grounded lighting codes, enabling controllable and high-fidelity fill lighting at low computational cost. Experiments on held-out paired sets demonstrate strong perceptual quality and competitive full-reference metrics, while better preserving background illumination. The dataset and model will be at https://github.com/gobunu/Light-Up-Your-Face.
