A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization
Yejun Zhang, Shuzhe Wang, Juho Kannala
TL;DR
A2-GNN introduces an Angle-Annular Graph Neural Network for visual descriptor-free camera relocalization, leveraging bearing vectors and a local graph with angle-based neighbor grouping to capture robust geometric structure. The architecture combines a feature encoder, angle-annular geometric processing, optimal transport for initialization, and bearing-vector-based outlier rejection, trained with a joint matching and outlier loss. Empirical results on MegaDepth, Cambridge Landmark, and 7Scenes show state-of-the-art performance among descriptor-free methods with substantial efficiency gains over prior descriptor-free approaches. The work highlights the feasibility and practical impact of descriptor-free 2D–3D matching for robust, privacy-preserving localization, while acknowledging remaining gaps to descriptor-based methods and sensitivity to high outlier ratios.
Abstract
Visual localization involves estimating the 6-degree-of-freedom (6-DoF) camera pose within a known scene. A critical step in this process is identifying pixel-to-point correspondences between 2D query images and 3D models. Most advanced approaches currently rely on extensive visual descriptors to establish these correspondences, facing challenges in storage, privacy issues and model maintenance. Direct 2D-3D keypoint matching without visual descriptors is becoming popular as it can overcome those challenges. However, existing descriptor-free methods suffer from low accuracy or heavy computation. Addressing this gap, this paper introduces the Angle-Annular Graph Neural Network (A2-GNN), a simple approach that efficiently learns robust geometric structural representations with annular feature extraction. Specifically, this approach clusters neighbors and embeds each group's distance information and angle as supplementary information to capture local structures. Evaluation on matching and visual localization datasets demonstrates that our approach achieves state-of-the-art accuracy with low computational overhead among visual description-free methods. Our code will be released on https://github.com/YejunZhang/a2-gnn.
