PriorMapNet: Enhancing Online Vectorized HD Map Construction with Priors
Rongxuan Wang, Xin Lu, Xiaoyang Liu, Xiaoyi Zou, Tongyi Cao, Ying Li
TL;DR
PriorMapNet addresses unstable matching in online vectorized HD map construction by injecting priors into both the decoder and encoder. It introduces the PPS-Decoder with position and structure priors derived from offline clustering, the PF-Encoder that provides BEV feature priors, and the DMD cross-attention to improve efficiency. The approach achieves state-of-the-art results on nuScenes and Argoverse 2, demonstrates robustness across enlarged BEV ranges, and maintains practical inference speed. Limitations include the absence of semantic priors and reliance on single-frame inputs, pointing to future work on temporal integration and richer priors.
Abstract
Online vectorized High-Definition (HD) map construction is crucial for subsequent prediction and planning tasks in autonomous driving. Following MapTR paradigm, recent works have made noteworthy achievements. However, reference points are randomly initialized in mainstream methods, leading to unstable matching between predictions and ground truth. To address this issue, we introduce PriorMapNet to enhance online vectorized HD map construction with priors. We propose the PPS-Decoder, which provides reference points with position and structure priors. Fitted from the map elements in the dataset, prior reference points lower the learning difficulty and achieve stable matching. Furthermore, we propose the PF-Encoder to enhance the image-to-BEV transformation with BEV feature priors. Besides, we propose the DMD cross-attention, which decouples cross-attention along multi-scale and multi-sample respectively to achieve efficiency. Our proposed PriorMapNet achieves state-of-the-art performance in the online vectorized HD map construction task on nuScenes and Argoverse2 datasets. The code will be released publicly soon.
