AvatarPerfect: User-Assisted 3D Gaussian Splatting Avatar Refinement with Automatic Pose Suggestion
Jotaro Sakamiya, I-Chao Shen, Jinsong Zhang, Mustafa Doga Dogan, Takeo Igarashi
TL;DR
AvatarPerfect addresses artifacts in 3D Gaussian Splatting (3DGS) avatars trained from monocular video by enabling user-guided refinement through 2D image editing and automatic pose suggestions. The system renders artifact-laden views with proposed body and camera poses, lets users edit the 2D renderings with background, inpaint, and diffusion-inpaint tools, and then retrains the 3DGS avatar using both the original video and user edits. Key contributions include a next-best-view inspired pose-suggestion mechanism based on Gaussian visibility, a practical 2D editing workflow to repair floating and anomalous color Gaussians, and a retraining scheme that integrates user edits with automatic updates. User studies and crowdsourced evaluations demonstrate that AvatarPerfect yields higher-quality 3DGS avatars under novel poses compared to a baseline editor, highlighting the value of combining intuitive 2D editing with automated pose guidance for 3D avatar refinement in VR/telepresence applications.
Abstract
Creating high-quality 3D avatars using 3D Gaussian Splatting (3DGS) from a monocular video benefits virtual reality and telecommunication applications. However, existing automatic methods exhibit artifacts under novel poses due to limited information in the input video. We propose AvatarPerfect, a novel system that allows users to iteratively refine 3DGS avatars by manually editing the rendered avatar images. In each iteration, our system suggests a new body and camera pose to help users identify and correct artifacts. The edited images are then used to update the current avatar, and our system suggests the next body and camera pose for further refinement. To investigate the effectiveness of AvatarPerfect, we conducted a user study comparing our method to an existing 3DGS editor SuperSplat, which allows direct manipulation of Gaussians without automatic pose suggestions. The results indicate that our system enables users to obtain higher quality refined 3DGS avatars than the existing 3DGS editor.
