4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes
Jinbo Yan, Rui Peng, Luyang Tang, Ronggang Wang
TL;DR
SaRO-GS introduces a real-time capable dynamic scene representation by uniting 4D Gaussian primitives with a Scale-aware Residual Field and an Adaptive Optimization schedule. By projecting 4D Gaussians into 3D with time-aware attributes and leveraging a scale-aware residual encoding via hex-planes and MipMap levels, the method captures temporally complex phenomena such as appearance/disappearance while maintaining interactive frame rates. The approach demonstrates state-of-the-art rendering quality and speed on monocular and multi-view datasets, achieving up to 182 FPS at 400×400 resolution and superior reconstruction metrics. This work advances dynamic scene rendering by enabling high-fidelity, temporally rich reconstructions suitable for real-time immersive applications, with practical impact in VR/AR pipelines and live synthesis.
Abstract
Reconstructing dynamic scenes from video sequences is a highly promising task in the multimedia domain. While previous methods have made progress, they often struggle with slow rendering and managing temporal complexities such as significant motion and object appearance/disappearance. In this paper, we propose SaRO-GS as a novel dynamic scene representation capable of achieving real-time rendering while effectively handling temporal complexities in dynamic scenes. To address the issue of slow rendering speed, we adopt a Gaussian primitive-based representation and optimize the Gaussians in 4D space, which facilitates real-time rendering with the assistance of 3D Gaussian Splatting. Additionally, to handle temporally complex dynamic scenes, we introduce a Scale-aware Residual Field. This field considers the size information of each Gaussian primitive while encoding its residual feature and aligns with the self-splitting behavior of Gaussian primitives. Furthermore, we propose an Adaptive Optimization Schedule, which assigns different optimization strategies to Gaussian primitives based on their distinct temporal properties, thereby expediting the reconstruction of dynamic regions. Through evaluations on monocular and multi-view datasets, our method has demonstrated state-of-the-art performance. Please see our project page at https://yjb6.github.io/SaRO-GS.github.io.
