CPO: Change Robust Panorama to Point Cloud Localization
Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim
TL;DR
CPO addresses panorama-to-point-cloud localization under scene changes by combining fast color-histogram matching with robust 2D $M_{2D}$ and 3D $M_{3D}$ score maps. It avoids heavy per-view rendering through cached histograms and nearest-neighbor reuse, exploiting spherical projection to quickly generate millions of synthetic views. Candidate poses are efficiently initialized using histogram-based scoring and refined via a weighted loss $L_ ext{sampling}$ that downweights regions likely affected by changes, yielding stable convergence without pose priors. Experiments on OmniScenes, Structured3D, Stanford 2D-3D-S, and Data61/2D3D demonstrate strong change-robust localization, superior to baselines in both changed and unchanged scenes, with practical runtime on a single GPU. The approach offers a lightweight, scalable solution for AR/VR and autonomous navigation where scene changes are common.
Abstract
We present CPO, a fast and robust algorithm that localizes a 2D panorama with respect to a 3D point cloud of a scene possibly containing changes. To robustly handle scene changes, our approach deviates from conventional feature point matching, and focuses on the spatial context provided from panorama images. Specifically, we propose efficient color histogram generation and subsequent robust localization using score maps. By utilizing the unique equivariance of spherical projections, we propose very fast color histogram generation for a large number of camera poses without explicitly rendering images for all candidate poses. We accumulate the regional consistency of the panorama and point cloud as 2D/3D score maps, and use them to weigh the input color values to further increase robustness. The weighted color distribution quickly finds good initial poses and achieves stable convergence for gradient-based optimization. CPO is lightweight and achieves effective localization in all tested scenarios, showing stable performance despite scene changes, repetitive structures, or featureless regions, which are typical challenges for visual localization with perspective cameras. Code is available at \url{https://github.com/82magnolia/panoramic-localization/}.
