GASP: Gaussian Avatars with Synthetic Priors

Jack Saunders; Charlie Hewitt; Yanan Jian; Marek Kowalski; Tadas Baltrusaitis; Yiye Chen; Darren Cosker; Virginia Estellers; Nicholas Gyde; Vinay P. Namboodiri; Benjamin E Lundell

GASP: Gaussian Avatars with Synthetic Priors

Jack Saunders, Charlie Hewitt, Yanan Jian, Marek Kowalski, Tadas Baltrusaitis, Yiye Chen, Darren Cosker, Virginia Estellers, Nicholas Gyde, Vinay P. Namboodiri, Benjamin E Lundell

TL;DR

GASP addresses the challenge of generating photorealistic, animatable avatars from minimal data by training a synthetic prior over Gaussian Avatar parameters using a large synthetic dataset. The method binds per-Gaussian features to a mesh-attached Gaussian representation and employs a three-stage fitting pipeline (inversion, D-finetuning, and Gaussian refinement) to bridge the synthetic-real domain gap and enable 360° rendering. The resulting avatars can be animated and rendered at around 70fps on consumer hardware, stored as compact ~15MB meshes, and support back-of-head reconstruction despite training from frontal views. Across monocular, single-image, and multi-camera evaluations, GASP achieves state-of-the-art or competitive results with reduced artifacts in unseen views, demonstrating practical applicability for VR, video conferencing, and entertainment.

Abstract

Gaussian Splatting has changed the game for real-time photo-realistic rendering. One of the most popular applications of Gaussian Splatting is to create animatable avatars, known as Gaussian Avatars. Recent works have pushed the boundaries of quality and rendering efficiency but suffer from two main limitations. Either they require expensive multi-camera rigs to produce avatars with free-view rendering, or they can be trained with a single camera but only rendered at high quality from this fixed viewpoint. An ideal model would be trained using a short monocular video or image from available hardware, such as a webcam, and rendered from any view. To this end, we propose GASP: Gaussian Avatars with Synthetic Priors. To overcome the limitations of existing datasets, we exploit the pixel-perfect nature of synthetic data to train a Gaussian Avatar prior. By fitting this prior model to a single photo or video and fine-tuning it, we get a high-quality Gaussian Avatar, which supports 360$^\circ$ rendering. Our prior is only required for fitting, not inference, enabling real-time application. Through our method, we obtain high-quality, animatable Avatars from limited data which can be animated and rendered at 70fps on commercial hardware. See our project page (https://microsoft.github.io/GASP/) for results.

GASP: Gaussian Avatars with Synthetic Priors

TL;DR

Abstract

GASP: Gaussian Avatars with Synthetic Priors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)