Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

Hanyang Kong; Xingyi Yang; Xinchao Wang

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

Hanyang Kong, Xingyi Yang, Xinchao Wang

TL;DR

Efficient Dynamic Gaussian Splatting (EDGS) tackles the heavy computational burden of rendering dynamic scenes from monocular videos by introducing a sparse, time-variant attribute modeling framework. It uses a sparse anchor-grid initialized from COLMAP, with time-invariant Gaussian attributes decoded by tiny MLPs and time-variant attributes filtered through a time-mask MLP, ensuring only deformable regions are processed each frame. Dynamics are modeled sparsely via an RBF kernel that propagates anchor motions to per-Gaussian offsets, enabling precise yet efficient motion representation. Empirical results on NeRF-DS and HyperNeRF show EDGS achieves higher PSNR/SSIM and significantly faster FPS with far fewer Gaussians than state-of-the-art methods, underscoring its practical value for real-time dynamic scene rendering.

Abstract

Rendering dynamic scenes from monocular videos is a crucial yet challenging task. The recent deformable Gaussian Splatting has emerged as a robust solution to represent real-world dynamic scenes. However, it often leads to heavily redundant Gaussians, attempting to fit every training view at various time steps, leading to slower rendering speeds. Additionally, the attributes of Gaussians in static areas are time-invariant, making it unnecessary to model every Gaussian, which can cause jittering in static regions. In practice, the primary bottleneck in rendering speed for dynamic scenes is the number of Gaussians. In response, we introduce Efficient Dynamic Gaussian Splatting (EDGS), which represents dynamic scenes via sparse time-variant attribute modeling. Our approach formulates dynamic scenes using a sparse anchor-grid representation, with the motion flow of dense Gaussians calculated via a classical kernel representation. Furthermore, we propose an unsupervised strategy to efficiently filter out anchors corresponding to static areas. Only anchors associated with deformable objects are input into MLPs to query time-variant attributes. Experiments on two real-world datasets demonstrate that our EDGS significantly improves the rendering speed with superior rendering quality compared to previous state-of-the-art methods.

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

TL;DR

Abstract

Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)