GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration

Rendong Zhang; Alexandra Watkins; Nilanjan Sarkar

GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration

Rendong Zhang, Alexandra Watkins, Nilanjan Sarkar

TL;DR

This work introduces GSAC, an end-to-end Gaussian Splatting avatar pipeline that converts monocular video into a photorealistic, riggable SMPL-X-based avatar compatible with Unity. It combines a preprocessing stage (SMPL-X/DECA/mmpose-based estimation and missing-hand handling), a GS training stage (Gaussian splats bound to SMPL-X polygons with targeted losses), and a Unity Editor for real-time rendering and animation via GPU-based splats. The approach achieves faster preprocessing and competitive visual quality (PSNR/SSIM/LPIPS) with real-time performance in Unity (FPS > 60) and supports VR/AR application development, while acknowledging artifacts in unobserved regions and limitations in cloth dynamics. These findings demonstrate a practical, scalable path toward accessible, photorealistic, animatable avatars for immersive VR/AR experiences and interactive training scenarios.

Abstract

Photorealistic avatars have become essential for immersive applications in virtual reality (VR) and augmented reality (AR), enabling lifelike interactions in areas such as training simulations, telemedicine, and virtual collaboration. These avatars bridge the gap between the physical and digital worlds, improving the user experience through realistic human representation. However, existing avatar creation techniques face significant challenges, including high costs, long creation times, and limited utility in virtual applications. Manual methods, such as MetaHuman, require extensive time and expertise, while automatic approaches, such as NeRF-based pipelines often lack efficiency, detailed facial expression fidelity, and are unable to be rendered at a speed sufficent for real-time applications. By involving several cutting-edge modern techniques, we introduce an end-to-end 3D Gaussian Splatting (3DGS) avatar creation pipeline that leverages monocular video input to create a scalable and efficient photorealistic avatar directly compatible with the Unity game engine. Our pipeline incorporates a novel Gaussian splatting technique with customized preprocessing that enables the user of "in the wild" monocular video capture, detailed facial expression reconstruction and embedding within a fully rigged avatar model. Additionally, we present a Unity-integrated Gaussian Splatting Avatar Editor, offering a user-friendly environment for VR/AR application development. Experimental results validate the effectiveness of our preprocessing pipeline in standardizing custom data for 3DGS training and demonstrate the versatility of Gaussian avatars in Unity, highlighting the scalability and practicality of our approach.

GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration

TL;DR

Abstract

GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)