Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU

Yicheng Lin; Yunlong Jiang; Xujia Jiao; Bin Han

Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU

Yicheng Lin, Yunlong Jiang, Xujia Jiao, Bin Han

TL;DR

This work tackles long-term visual localization under appearance changes by proposing a hierarchical framework that blends real-time handcrafted feature tracking with selective, offline-learned keypoints for absolute pose. A unified learning-based feature extraction module enables cross-method compatibility, while a hierarchical pose optimization fuses handcrafted and learned observations within a local map to correct accumulated error on a CPU. The approach demonstrates substantial improvements in global localization accuracy across seasonal changes and maintains practical CPU efficiency, with 47% average error reduction reported in photometric variation scenarios. The results suggest a robust, universally applicable strategy for industrial robotics, paving the way for unified feature representations and more efficient learned-keypoint architectures.

Abstract

Robust long-term visual localization in complex industrial environments is critical for mobile robotic systems. Existing approaches face limitations: handcrafted features are illumination-sensitive, learned features are computationally intensive, and semantic- or marker-based methods are environmentally constrained. Handcrafted and learned features share similar representations but differ functionally. Handcrafted features are optimized for continuous tracking, while learned features excel in wide-baseline matching. Their complementarity calls for integration rather than replacement. Building on this, we propose a hierarchical localization framework. It leverages real-time handcrafted feature extraction for relative pose estimation. In parallel, it employs selective learned keypoint detection on optimized keyframes for absolute positioning. This design enables CPU-efficient, long-term visual localization. Experiments systematically progress through three validation phases: Initially establishing feature complementarity through comparative analysis, followed by computational latency profiling across algorithm stages on CPU platforms. Final evaluation under photometric variations (including seasonal transitions and diurnal cycles) demonstrates 47% average error reduction with significantly improved localization consistency. The code implementation is publicly available at https://github.com/linyicheng1/ORB_SLAM3_localization.

Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU

TL;DR

Abstract

Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)