DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction
Kai Xu, Tze Ho Elden Tse, Jizong Peng, Angela Yao
TL;DR
<3-5 sentence high-level summary> DAS3R tackles static-background reconstruction from dynamic, unposed videos by predicting dynamic masks from image pairs and integrating them into a dynamics-aware Gaussian Splatting framework. It operates without camera intrinsics or depth information, leveraging global alignment and a staticness attribute within Gaussian rendering to separate static and dynamic content. The method achieves over 2 dB PSNR gains on DAVIS and Sintel and improves camera pose estimation under challenging dynamics. While robust, it can incur false positives in depth-variant regions, suggesting future refinements with more diverse data and optimization strategies.
Abstract
We propose a novel framework for scene decomposition and static background reconstruction from everyday videos. By integrating the trained motion masks and modeling the static scene as Gaussian splats with dynamics-aware optimization, our method achieves more accurate background reconstruction results than previous works. Our proposed method is termed DAS3R, an abbreviation for Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction. Compared to existing methods, DAS3R is more robust in complex motion scenarios, capable of handling videos where dynamic objects occupy a significant portion of the scene, and does not require camera pose inputs or point cloud data from SLAM-based methods. We compared DAS3R against recent distractor-free approaches on the DAVIS and Sintel datasets; DAS3R demonstrates enhanced performance and robustness with a margin of more than 2 dB in PSNR. The project's webpage can be accessed via \url{https://kai422.github.io/DAS3R/}
