CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization
Jan Ackermann, Jonas Kulhanek, Shengqu Cai, Haofei Xu, Marc Pollefeys, Gordon Wetzstein, Leonidas Guibas, Songyou Peng
TL;DR
CL-Splats introduces a continual-learning framework for Gaussian Splatting that updates 3D scene reconstructions in a localized, efficient manner using sparse time-sliced views. It detects 2D changes with a DINOv2-based mask, lifts those changes into 3D via majority voting, and performs exact, region-limited optimization within dynamically pruned spheres. The approach yields higher reconstruction quality and faster convergence than state-of-the-art baselines like CL-NeRF and GaussianEditor, while enabling history recovery and batched updates with modest storage. Across synthetic and real-world datasets, CL-Splats demonstrates robust handling of object additions, removals, and movements, making it well-suited for mixed reality, robotics, and embodied AI tasks where up-to-date 3D reconstructions are critical.
Abstract
In dynamic 3D environments, accurately updating scene representations over time is crucial for applications in robotics, mixed reality, and embodied AI. As scenes evolve, efficient methods to incorporate changes are needed to maintain up-to-date, high-quality reconstructions without the computational overhead of re-optimizing the entire scene. This paper introduces CL-Splats, which incrementally updates Gaussian splatting-based 3D representations from sparse scene captures. CL-Splats integrates a robust change-detection module that segments updated and static components within the scene, enabling focused, local optimization that avoids unnecessary re-computation. Moreover, CL-Splats supports storing and recovering previous scene states, facilitating temporal segmentation and new scene-analysis applications. Our extensive experiments demonstrate that CL-Splats achieves efficient updates with improved reconstruction quality over the state-of-the-art. This establishes a robust foundation for future real-time adaptation in 3D scene reconstruction tasks.
