Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction

Weirong Chen; Ganlin Zhang; Felix Wimbauer; Rui Wang; Nikita Araslanov; Andrea Vedaldi; Daniel Cremers

Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction

Weirong Chen, Ganlin Zhang, Felix Wimbauer, Rui Wang, Nikita Araslanov, Andrea Vedaldi, Daniel Cremers

TL;DR

This paper tackles dynamic scene reconstruction from casual videos by decoupling camera motion from object motion using a learnable 3D tracker, enabling traditional bundle adjustment to operate on both static and dynamic points. It introduces BA-Track, a three-stage pipeline: a motion-decoupled 3D tracker, RGB-D bundle adjustment, and a global depth refinement that enforces depth consistency and rigidity. The approach yields improved camera pose accuracy (ATE) and more coherent dense reconstructions across challenging dynamic datasets, while maintaining memory efficiency. The results demonstrate that combining deep priors with classical optimization can robustly handle real-world dynamic scenes, with potential for future joint intrinsic refinement and richer depth models.

Abstract

Traditional SLAM systems, which rely on bundle adjustment, struggle with highly dynamic scenes commonly found in casual videos. Such videos entangle the motion of dynamic elements, undermining the assumption of static environments required by traditional systems. Existing techniques either filter out dynamic elements or model their motion independently. However, the former often results in incomplete reconstructions, whereas the latter can lead to inconsistent motion estimates. Taking a novel approach, this work leverages a 3D point tracker to separate the camera-induced motion from the observed motion of dynamic objects. By considering only the camera-induced component, bundle adjustment can operate reliably on all scene elements as a result. We further ensure depth consistency across video frames with lightweight post-processing based on scale maps. Our framework combines the core of traditional SLAM -- bundle adjustment -- with a robust learning-based 3D tracker front-end. Integrating motion decomposition, bundle adjustment and depth refinement, our unified framework, BA-Track, accurately tracks the camera motion and produces temporally coherent and scale-consistent dense reconstructions, accommodating both static and dynamic elements. Our experiments on challenging datasets reveal significant improvements in camera pose estimation and 3D reconstruction accuracy.

Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction

TL;DR

Abstract

Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)