3D Gaussian Blendshapes for Head Avatar Animation
Shengjie Ma, Yanlin Weng, Tianjia Shao, Kun Zhou
TL;DR
This paper introduces a 3D Gaussian blendshape representation for head avatars that unifies classic mesh-based semantics with a Gaussian splatting renderer. By modeling a neutral base B0, multiple expression blendshapes ΔBk, and a mouth interior set Bm as Gaussians, and by linearly blending these with expression coefficients ψ and applying linear blend skinning with Θ, the approach achieves real-time, high-fidelity avatar animation from monocular video. A key contribution is enforcing semantic consistency between Gaussian blendshapes and their mesh counterparts via an intermediate variable that links Gaussian differences to mesh deformations, which improves generalization to novel expressions. The method outperforms state-of-the-art NeRF- and point-based approaches in quality metrics and runs at up to 370fps with tens of thousands of Gaussians, offering a practical solution for photoreal telepresence and AR/VR applications while acknowledging limitations such as side-view generalization and potential misuse.
Abstract
We introduce 3D Gaussian blendshapes for modeling photorealistic head avatars. Taking a monocular video as input, we learn a base head model of neutral expression, along with a group of expression blendshapes, each of which corresponds to a basis expression in classical parametric face models. Both the neutral model and expression blendshapes are represented as 3D Gaussians, which contain a few properties to depict the avatar appearance. The avatar model of an arbitrary expression can be effectively generated by combining the neutral model and expression blendshapes through linear blending of Gaussians with the expression coefficients. High-fidelity head avatar animations can be synthesized in real time using Gaussian splatting. Compared to state-of-the-art methods, our Gaussian blendshape representation better captures high-frequency details exhibited in input video, and achieves superior rendering performance.
