Total-Editing: Head Avatar with Editable Appearance, Motion, and Lighting

Yizhou Zhao; Chunjiang Liu; Haoyu Chen; Bhiksha Raj; Min Xu; Tadas Baltrusaitis; Mitch Rundle; HsiangTao Wu; Kamran Ghasedi

Total-Editing: Head Avatar with Editable Appearance, Motion, and Lighting

Yizhou Zhao, Chunjiang Liu, Haoyu Chen, Bhiksha Raj, Min Xu, Tadas Baltrusaitis, Mitch Rundle, HsiangTao Wu, Kamran Ghasedi

TL;DR

Total-Editing addresses the joint problem of portrait reenactment and relighting by introducing an intrinsically decomposed NeRF with Phong-based shading, lightmaps, and an MLS deformation framework. By disentangling appearance, motion, and lighting and training on a large synthetic dataset plus real video data, the method achieves superior 3D-aware portrait editing with controllable illumination from either a portrait image or an HDR environment map. Key contributions include the intrinsically decomposed NeRF decoder, the MLS-based deformation for smooth spatiotemporal coherence, and a lightmap-based illumination pathway that enables accurate shading during head motion and lighting transfer. The approach yields higher quality and more flexible results than prior methods, enabling applications such as illumination transfer and background changes in animated portraits, with potential impact for AR/VR, social media, and film production.

Abstract

Face reenactment and portrait relighting are essential tasks in portrait editing, yet they are typically addressed independently, without much synergy. Most face reenactment methods prioritize motion control and multiview consistency, while portrait relighting focuses on adjusting shading effects. To take advantage of both geometric consistency and illumination awareness, we introduce Total-Editing, a unified portrait editing framework that enables precise control over appearance, motion, and lighting. Specifically, we design a neural radiance field decoder with intrinsic decomposition capabilities. This allows seamless integration of lighting information from portrait images or HDR environment maps into synthesized portraits. We also incorporate a moving least squares based deformation field to enhance the spatiotemporal coherence of avatar motion and shading effects. With these innovations, our unified framework significantly improves the quality and realism of portrait editing results. Further, the multi-source nature of Total-Editing supports more flexible applications, such as illumination transfer from one portrait to another, or portrait animation with customized backgrounds.

Total-Editing: Head Avatar with Editable Appearance, Motion, and Lighting

TL;DR

Abstract

Total-Editing: Head Avatar with Editable Appearance, Motion, and Lighting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)