SAGA: Surface-Aligned Gaussian Avatar
Ronghan Chen, Yang Cong, Jiayue Liu
TL;DR
This work tackles monocular dynamic human reconstruction by introducing SAGA, a two-stage Surface-Aligned Gaussian Avatar. The method first adheres Gaussians to a coarse SMPL mesh to enforce well-defined geometry and then detaches them to capture fine deformations, aided by Gaussian–Mesh alignment regularization and a Walking-on-Mesh strategy to keep triangle bindings accurate. The approach delivers state-of-the-art novel-view and novel-pose synthesis with fast training (~12 minutes) and real-time rendering (60 FPS), and enables direct high-quality mesh extraction from deformable Gaussians learned from monocular videos. By combining mesh regularization with expressive Gaussians and pose-driven colorization, SAGA achieves superior geometric fidelity and rendering realism in challenging monocular settings, marking a significant advance for practical, photorealistic avatar apps.
Abstract
This paper presents a Surface-Aligned Gaussian representation for creating animatable human avatars from monocular videos,aiming at improving the novel view and pose synthesis performance while ensuring fast training and real-time rendering. Recently,3DGS has emerged as a more efficient and expressive alternative to NeRF, and has been used for creating dynamic human avatars. However,when applied to the severely ill-posed task of monocular dynamic reconstruction, the Gaussians tend to overfit the constantly changing regions such as clothes wrinkles or shadows since these regions cannot provide consistent supervision, resulting in noisy geometry and abrupt deformation that typically fail to generalize under novel views and poses.To address these limitations, we present SAGA,i.e.,Surface-Aligned Gaussian Avatar,which aligns the Gaussians with a mesh to enforce well-defined geometry and consistent deformation, thereby improving generalization under novel views and poses. Unlike existing strict alignment methods that suffer from limited expressive power and low realism,SAGA employs a two-stage alignment strategy where the Gaussians are first adhered on while then detached from the mesh, thus facilitating both good geometry and high expressivity. In the Adhered Stage, we improve the flexibility of Adhered-on-Mesh Gaussians by allowing them to flow on the mesh, in contrast to existing methods that rigidly bind Gaussians to fixed location. In the second Detached Stage, we introduce a Gaussian-Mesh Alignment regularization, which allows us to unleash the expressivity by detaching the Gaussians but maintain the geometric alignment by minimizing their location and orientation offsets from the bound triangles. Finally, since the Gaussians may drift outside the bound triangles during optimization, an efficient Walking-on-Mesh strategy is proposed to dynamically update the bound triangles.
