R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization
Xudong Jiang, Fangjinhua Wang, Silvano Galliani, Christoph Vogel, Marc Pollefeys
TL;DR
R-SCoRe revisits scene coordinate regression for robust, large-scale visual localization. It introduces covisibility graph-based global encoding learning, data augmentation, and a depth-adjusted reprojection loss to promote implicit triangulation, integrated in a GLACE-inspired coarse-to-fine network with a refinement module. The approach achieves state-of-the-art SCR performance on challenging datasets like Aachen Day-Night with a small map (47MB) and up to 10x improvement over previous SCR methods, approaching the accuracy of feature-matching methods while maintaining a fraction of map size. Ablations demonstrate the value of multi-hypotheses testing, covisibility-informed encodings, and depth-aware supervision, supporting practical, scalable localization without 3D ground-truth supervision. The work suggests promising directions for closing the accuracy gap to FM methods through further integration with generative models and model compression.
Abstract
Learning-based visual localization methods that use scene coordinate regression (SCR) offer the advantage of smaller map sizes. However, on datasets with complex illumination changes or image-level ambiguities, it remains a less robust alternative to feature matching methods. This work aims to close the gap. We introduce a covisibility graph-based global encoding learning and data augmentation strategy, along with a depth-adjusted reprojection loss to facilitate implicit triangulation. Additionally, we revisit the network architecture and local feature extraction module. Our method achieves state-of-the-art on challenging large-scale datasets without relying on network ensembles or 3D supervision. On Aachen Day-Night, we are 10$\times$ more accurate than previous SCR methods with similar map sizes and require at least 5$\times$ smaller map sizes than any other SCR method while still delivering superior accuracy. Code is available at: https://github.com/cvg/scrstudio .
