Pole-based Vehicle Localization with Vector Maps: A Camera-LiDAR Comparative Study
Maxime Noizet, Philippe Xu, Philippe Bonnifait
TL;DR
This work tackles urban localization challenges under GNSS unreliability by leveraging pole-like road features referenced in cm-level vector HD maps. It compares camera-based pole detection against LiDAR-based detection and integrates detected poles into an EKF-based localization filter that fuses GNSS/DR with pole measurements and map features. The authors present a real-time, image-based pole detector trained with automatically annotated data and a geometry-based LiDAR pole detector, demonstrating that a multi-camera setup can achieve localization performance comparable to LiDAR on a challenging peri-urban sequence. Key contributions include a pole-based localization framework with map matching via Hungarian assignment, and an empirical evaluation showing that front cameras yield strong cross-track gains while side cameras boost along-track accuracy, with multi-camera fusion closely approaching LiDAR performance. The results support using multi-camera pole detection as a cost-effective alternative or complement to LiDAR for robust map-aided vehicle localization in feature-rich urban environments.
Abstract
For autonomous navigation, accurate localization with respect to a map is needed. In urban environments, infrastructure such as buildings or bridges cause major difficulties to Global Navigation Satellite Systems (GNSS) and, despite advances in inertial navigation, it is necessary to support them with other sources of exteroceptive information. In road environments, many common furniture such as traffic signs, traffic lights and street lights take the form of poles. By georeferencing these features in vector maps, they can be used within a localization filter that includes a detection pipeline and a data association method. Poles, having discriminative vertical structures, can be extracted from 3D geometric information using LiDAR sensors. Alternatively, deep neural networks can be employed to detect them from monocular cameras. The lack of depth information induces challenges in associating camera detections with map features. Yet, multi-camera integration provides a cost-efficient solution. This paper quantitatively evaluates the efficacy of these approaches in terms of localization. It introduces a real-time method for camera-based pole detection using a lightweight neural network trained on automatically annotated images. The proposed methods' efficiency is assessed on a challenging sequence with a vector map. The results highlight the high accuracy of the vision-based approach in open road conditions.
