Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors
Son Tung Nguyen, Tobias Fischer, Alejandro Fontan, Michael Milford
TL;DR
The paper tackles perceptual aliasing in visual Localization by introducing a neural aggregator that learns geometrically-consistent global descriptors, aligning visual similarity with covisibility structure.A batch-mining-based training scheme and a modified Generalized Contrastive Loss (mGCL) enable training without manual place labels and improve robustness to noisy graphs.The method achieves notable gains over R-Score on challenging benchmarks (e.g., Aachen Day/Night, Hyundai Department Store) while maintaining low memory overhead, narrowing the gap to traditional structure-based methods.Overall, the work demonstrates that jointly optimized local and global descriptors, guided by dual geometric-visual consistency, substantially enhances SCR performance in large-scale, alias-prone environments.
Abstract
Recent learning-based visual localization methods use global descriptors to disambiguate visually similar places, but existing approaches often derive these descriptors from geometric cues alone (e.g., covisibility graphs), limiting their discriminative power and reducing robustness in the presence of noisy geometric constraints. We propose an aggregator module that learns global descriptors consistent with both geometrical structure and visual similarity, ensuring that images are close in descriptor space only when they are visually similar and spatially connected. This corrects erroneous associations caused by unreliable overlap scores. Using a batch-mining strategy based solely on the overlap scores and a modified contrastive loss, our method trains without manual place labels and generalizes across diverse environments. Experiments on challenging benchmarks show substantial localization gains in large-scale environments while preserving computational and memory efficiency. Code is available at \href{https://github.com/sontung/robust\_scr}{github.com/sontung/robust\_scr}.
