ConDo: Continual Domain Expansion for Absolute Pose Regression

Zijun Li; Zhipeng Cai; Bochun Yang; Xuelun Shen; Siqi Shen; Xiaoliang Fan; Michael Paulitsch; Cheng Wang

ConDo: Continual Domain Expansion for Absolute Pose Regression

Zijun Li, Zhipeng Cai, Bochun Yang, Xuelun Shen, Siqi Shen, Xiaoliang Fan, Michael Paulitsch, Cheng Wang

TL;DR

ConDo tackles the brittleness of Absolute Pose Regression (APR) under continual environmental changes by leveraging unlabeled inference data collected after deployment. It distills robust cues from scene-agnostic localization methods to supervise APR updates, while keeping computation bounded and avoiding full re-training. The authors create large-scale benchmarks spanning indoor/outdoor scenes and long-term changes to demonstrate consistent, substantial improvements across architectures and data shifts, with up to 25x faster updates than re-training and significant error reductions on challenging scenes. This approach provides a practical path to life-long visual localization systems that remain accurate as environments evolve.

Abstract

Visual localization is a fundamental machine learning problem. Absolute Pose Regression (APR) trains a scene-dependent model to efficiently map an input image to the camera pose in a pre-defined scene. However, many applications have continually changing environments, where inference data at novel poses or scene conditions (weather, geometry) appear after deployment. Training APR on a fixed dataset leads to overfitting, making it fail catastrophically on challenging novel data. This work proposes Continual Domain Expansion (ConDo), which continually collects unlabeled inference data to update the deployed APR. Instead of applying standard unsupervised domain adaptation methods which are ineffective for APR, ConDo effectively learns from unlabeled data by distilling knowledge from scene-agnostic localization methods. By sampling data uniformly from historical and newly collected data, ConDo can effectively expand the generalization domain of APR. Large-scale benchmarks with various scene types are constructed to evaluate models under practical (long-term) data changes. ConDo consistently and significantly outperforms baselines across architectures, scene types, and data changes. On challenging scenes (Fig.1), it reduces the localization error by >7x (14.8m vs 1.7m). Analysis shows the robustness of ConDo against compute budgets, replay buffer sizes and teacher prediction noise. Comparing to model re-training, ConDo achieves similar performance up to 25x faster.

ConDo: Continual Domain Expansion for Absolute Pose Regression

TL;DR

Abstract

ConDo: Continual Domain Expansion for Absolute Pose Regression

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)