Look Ma, No Ground Truth! Ground-Truth-Free Tuning of Structure from Motion and Visual SLAM
Alejandro Fontan, Javier Civera, Tobias Fischer, Michael Milford
TL;DR
The paper tackles the challenge of evaluating SfM and Visual SLAM without relying on costly ground-truth data by introducing Ground-Truth-Free Absolute Trajectory Error (GTF ATE). It develops a Jacobian-based sensitivity framework and a sensitivity-sampling procedure that perturb input images with Gaussian noise, enabling end-to-end GT-free evaluation and hyperparameter tuning. Across multiple datasets and pipelines (GLOMAP for SfM and DROID-SLAM for VSLAM), the GT-free metric strongly correlates with standard GT-based ATE, achieving comparable improvements in tuning and demonstrating substantial potential for scalable, self-supervised optimization. This GT-free approach promises to broaden data sources, support online tuning, and drive data-driven advancement in real-world localization systems. The work lays a foundation for self-supervised, online refinement of SLAM systems with real-world applicability beyond curated benchmarks.
Abstract
Evaluation is critical to both developing and tuning Structure from Motion (SfM) and Visual SLAM (VSLAM) systems, but is universally reliant on high-quality geometric ground truth -- a resource that is not only costly and time-intensive but, in many cases, entirely unobtainable. This dependency on ground truth restricts SfM and SLAM applications across diverse environments and limits scalability to real-world scenarios. In this work, we propose a novel ground-truth-free (GTF) evaluation methodology that eliminates the need for geometric ground truth, instead using sensitivity estimation via sampling from both original and noisy versions of input images. Our approach shows strong correlation with traditional ground-truth-based benchmarks and supports GTF hyperparameter tuning. Removing the need for ground truth opens up new opportunities to leverage a much larger number of dataset sources, and for self-supervised and online tuning, with the potential for a data-driven breakthrough analogous to what has occurred in generative AI.
