HRAvatar: High-Quality and Relightable Gaussian Head Avatar
Dongbin Zhang, Yunfei Liu, Lijian Lin, Ye Zhu, Kangjie Chen, Minghan Qin, Yu Li, Haoqian Wang
TL;DR
HRAvatar introduces a monocular-head-avatar reconstruction method that combines 3D Gaussian Splatting with learnable per-point deformation and an end-to-end expression encoder to achieve high-fidelity geometry and expressive motion. The appearance model decomposes into intrinsic properties (albedo, roughness, Fresnel) and employs physically-based shading with environment maps, enabling realistic relighting under unknown lighting. Extensive experiments across INSTA, HDTF, and self-captured data show state-of-the-art quality and real-time relighting (~155 FPS), with ablations confirming the importance of the encoder, deformation strategy, and shading terms. While offering significant gains, the approach remains bounded by FLAME priors and intrinsic-albedo disentanglement challenges, pointing to future work in semantic material guidance and GPU-accelerated inference.
Abstract
Reconstructing animatable and high-quality 3D head avatars from monocular videos, especially with realistic relighting, is a valuable task. However, the limited information from single-view input, combined with the complex head poses and facial movements, makes this challenging. Previous methods achieve real-time performance by combining 3D Gaussian Splatting with a parametric head model, but the resulting head quality suffers from inaccurate face tracking and limited expressiveness of the deformation model. These methods also fail to produce realistic effects under novel lighting conditions. To address these issues, we propose HRAvatar, a 3DGS-based method that reconstructs high-fidelity, relightable 3D head avatars. HRAvatar reduces tracking errors through end-to-end optimization and better captures individual facial deformations using learnable blendshapes and learnable linear blend skinning. Additionally, it decomposes head appearance into several physical properties and incorporates physically-based shading to account for environmental lighting. Extensive experiments demonstrate that HRAvatar not only reconstructs superior-quality heads but also achieves realistic visual effects under varying lighting conditions.
