ControlFace: Harnessing Facial Parametric Control for Face Rigging
Wooseok Jang, Youngjun Hong, Geonho Cha, Seungryong Kim
TL;DR
ControlFace tackles face rigging by enabling precise, identity-preserving edits driven by explicit 3DMM renderings. It employs a dual-branch U-Network architecture (FaceNet for reference appearance and a denoising U-Net for generation) integrated via augmented self-attention, plus a Control Mixer Module and Reference Control Guidance to tightly couple target and reference controls. Training on facial video data using quadruplets {X_R,X_T,D_R,D_T} eliminates reconstruction bias and leverages rich identity cues from FaceNet. Across qualitative, quantitative, and user studies, ControlFace achieves superior control adherence, identity preservation, and image quality compared to baselines, showcasing practical applicability without per-identity fine-tuning. The approach advances face rigging by combining 3DMM-based renderings, grounded guidance, and efficient conditioning modules for robust real-world use.
Abstract
Manipulation of facial images to meet specific controls such as pose, expression, and lighting, also known as face rigging, is a complex task in computer vision. Existing methods are limited by their reliance on image datasets, which necessitates individual-specific fine-tuning and limits their ability to retain fine-grained identity and semantic details, reducing practical usability. To overcome these limitations, we introduce ControlFace, a novel face rigging method conditioned on 3DMM renderings that enables flexible, high-fidelity control. We employ a dual-branch U-Nets: one, referred to as FaceNet, captures identity and fine details, while the other focuses on generation. To enhance control precision, the control mixer module encodes the correlated features between the target-aligned control and reference-aligned control, and a novel guidance method, reference control guidance, steers the generation process for better control adherence. By training on a facial video dataset, we fully utilize FaceNet's rich representations while ensuring control adherence. Extensive experiments demonstrate ControlFace's superior performance in identity preservation and control precision, highlighting its practicality. Please see the project website: https://cvlab-kaist.github.io/ControlFace/.
