An Efficient Algorithm for Learning-Based Visual Localization

Jindi Zhong; Ziyuan Guo; Hongxia Wang; Huanshui Zhang

An Efficient Algorithm for Learning-Based Visual Localization

Jindi Zhong, Ziyuan Guo, Hongxia Wang, Huanshui Zhang

TL;DR

This work tackles GPS-denied visual localization under tight resource constraints by integrating an Optimal Control Principle (OCP) based optimizer with a diagonal Hessian approximation via Hutchinson's method. The Diag-OCP algorithm combines exponential moving averages of gradients and Hessian diagonals with an adaptive step-size, enabling a lightweight CNN to achieve competitive localization accuracy while maintaining efficiency. The authors prove a non-asymptotic convergence rate of $\mathcal{O}(1/T)$ under standard assumptions and demonstrate strong empirical performance on the KITTI dataset with a CNN containing under 1% of ResNet-18 parameters, highlighting rapid convergence and robust generalization. This approach offers a practical pathway to high-performance offline positioning on edge devices by marrying second-order curvature information with efficient diagonal approximations.

Abstract

This paper addresses the visual localization problem in Global Positioning System (GPS)-denied environments, where computational resources are often limited. To achieve efficient and robust performance under these constraints, we propose a novel algorithm. The algorithm stems from the optimal control principle (OCP). It incorporates diagonal information estimation of the Hessian matrix, which results in training a higher-performance deep neural network and accelerates optimization convergence. Experimental results on public datasets demonstrate that the final model achieves competitive localization accuracy and exhibits remarkable generalization capability. This study provides new insights for developing high-performance offline positioning systems.

An Efficient Algorithm for Learning-Based Visual Localization

TL;DR

Abstract

An Efficient Algorithm for Learning-Based Visual Localization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (10)