SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video
Jongmin Park, Minh-Quan Viet Bui, Juan Luis Gonzalez Bello, Jaeho Moon, Jihyong Oh, Munchurl Kim
TL;DR
SplineGS tackles real-time dynamic 3D view synthesis from monocular video without COLMAP by introducing Motion-Adaptive Spline (MAS) trajectories for dynamic 3D Gaussians and Motion-Adaptive Control point Pruning (MACP) to adapt complexity to motion. It jointly optimizes camera parameters and 3D Gaussian attributes in a two-stage process, guided by photometric and geometric consistency, and leverages a COLMAP-free setup to achieve state-of-the-art rendering quality with vastly faster rendering speeds. The framework demonstrates strong performance on challenging in-the-wild sequences, outperforming COLMAP-based and COLMAP-free baselines in NVS quality while delivering real-time or near real-time rendering. The combination of MAS, MACP, and joint optimization provides a robust, efficient pathway for dynamic NVS without external camera parameter priors, with practical impact for AR/VR and content creation in unconstrained environments.
Abstract
Synthesizing novel views from in-the-wild monocular videos is challenging due to scene dynamics and the lack of multi-view cues. To address this, we propose SplineGS, a COLMAP-free dynamic 3D Gaussian Splatting (3DGS) framework for high-quality reconstruction and fast rendering from monocular videos. At its core is a novel Motion-Adaptive Spline (MAS) method, which represents continuous dynamic 3D Gaussian trajectories using cubic Hermite splines with a small number of control points. For MAS, we introduce a Motion-Adaptive Control points Pruning (MACP) method to model the deformation of each dynamic 3D Gaussian across varying motions, progressively pruning control points while maintaining dynamic modeling integrity. Additionally, we present a joint optimization strategy for camera parameter estimation and 3D Gaussian attributes, leveraging photometric and geometric consistency. This eliminates the need for Structure-from-Motion preprocessing and enhances SplineGS's robustness in real-world conditions. Experiments show that SplineGS significantly outperforms state-of-the-art methods in novel view synthesis quality for dynamic scenes from monocular videos, achieving thousands times faster rendering speed.
