GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians
Shenhan Qian, Tobias Kirschstein, Liam Schoneveld, Davide Davoli, Simon Giebenhain, Matthias Nießner
TL;DR
GaussianAvatars tackles the challenge of producing photorealistic, controllable head avatars from multi-view video by rigging anisotropic 3D Gaussian splats to a FLAME morphable head model and optimizing them end-to-end. The method introduces a binding inheritance mechanism and adaptive density control to preserve controllability while enriching detail, enabling accurate animation under novel expressions and poses. Empirical results on NeRSemble show substantial gains in novel-view rendering and cross-identity reenactment over state-of-the-art baselines, with efficient training and inference. This work advances practical head avatars for immersive media and telepresence and highlights future directions for including hair and lighting flexibility.
Abstract
We introduce GaussianAvatars, a new method to create photorealistic head avatars that are fully controllable in terms of expression, pose, and viewpoint. The core idea is a dynamic 3D representation based on 3D Gaussian splats that are rigged to a parametric morphable face model. This combination facilitates photorealistic rendering while allowing for precise animation control via the underlying parametric model, e.g., through expression transfer from a driving sequence or by manually changing the morphable model parameters. We parameterize each splat by a local coordinate frame of a triangle and optimize for explicit displacement offset to obtain a more accurate geometric representation. During avatar reconstruction, we jointly optimize for the morphable model parameters and Gaussian splat parameters in an end-to-end fashion. We demonstrate the animation capabilities of our photorealistic avatar in several challenging scenarios. For instance, we show reenactments from a driving video, where our method outperforms existing works by a significant margin.
