Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps
Hengyuan Zhang, David Paz, Yuliang Guo, Arun Das, Xinyu Huang, Karsten Haug, Henrik I. Christensen, Liu Ren
TL;DR
The paper tackles the challenge of online HD map perception for autonomous driving under occlusion and long-range planning by introducing Standard Definition (SD) maps as lightweight priors. It presents two integration pathways: rasterized SD maps fused into BEV for perception and graph-based SD maps coupled with a BEV–SGNN pipeline for joint perception and reasoning. Experiments on OpenLane-V2 and OpenLane-V2-OSM demonstrate faster convergence and up to 30% improvement in centerline mAP, with graph-based SD maps offering parameter-efficient gains. The work contributes a public SD-map–augmented dataset and shows SD priors are robust to moderate localization noise, addressing scalability and maintenance challenges of HD maps in real-world online mapping. The perception range is extended in evaluations to $[-50m, 50m]$ from the typical $[-30m, 30m]$, highlighting the method’s effectiveness in longer-range planning scenarios. The study also reveals that SD-map graphs can reduce model size while preserving or enhancing accuracy, suggesting practical benefits for deployment in resource-constrained settings.
Abstract
Autonomous driving for urban and highway driving applications often requires High Definition (HD) maps to generate a navigation plan. Nevertheless, various challenges arise when generating and maintaining HD maps at scale. While recent online mapping methods have started to emerge, their performance especially for longer ranges is limited by heavy occlusion in dynamic environments. With these considerations in mind, our work focuses on leveraging lightweight and scalable priors-Standard Definition (SD) maps-in the development of online vectorized HD map representations. We first examine the integration of prototypical rasterized SD map representations into various online mapping architectures. Furthermore, to identify lightweight strategies, we extend the OpenLane-V2 dataset with OpenStreetMaps and evaluate the benefits of graphical SD map representations. A key finding from designing SD map integration components is that SD map encoders are model agnostic and can be quickly adapted to new architectures that utilize bird's eye view (BEV) encoders. Our results show that making use of SD maps as priors for the online mapping task can significantly speed up convergence and boost the performance of the online centerline perception task by 30% (mAP). Furthermore, we show that the introduction of the SD maps leads to a reduction of the number of parameters in the perception and reasoning task by leveraging SD map graphs while improving the overall performance. Project Page: https://henryzhangzhy.github.io/sdhdmap/.
