Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

Xiaolu Liu; Ruizi Yang; Song Wang; Wentong Li; Junbo Chen; Jianke Zhu

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

Xiaolu Liu, Ruizi Yang, Song Wang, Wentong Li, Junbo Chen, Jianke Zhu

TL;DR

This paper tackles the generalization gap in online HD map vectorization by proposing UIGenMap, an uncertainty-instructed framework that injects explicit PV structure into BEV map decoding. It combines an uncertainty-aware UA-Decoder with probabilistic attention and per-point uncertainty outputs, a UI2DPrompt module that builds PV-based prompts from PV detections, and a lightweight Mimic Query Distillation (MQ-Distillation) to enable real-time inference by substituting PV prompts with mimic queries. Through geo-based partitions on nuScenes and Argoverse2, UIGenMap achieves state-of-the-art gains (e.g., +5.7 mAP region-based on nuScenes, +4.3 mAP city-based on nuScenes; 60.4 mAP region-based on Argoverse2) and demonstrates robust generalization to unfamiliar driving scenes. The approach offers practical impact for robust HD map construction in autonomous driving by enhancing generalization while maintaining real-time performance, and is complemented by open-source code.

Abstract

Reliable high-definition (HD) map construction is crucial for the driving safety of autonomous vehicles. Although recent studies demonstrate improved performance, their generalization capability across unfamiliar driving scenes remains unexplored. To tackle this issue, we propose UIGenMap, an uncertainty-instructed structure injection approach for generalizable HD map vectorization, which concerns the uncertainty resampling in statistical distribution and employs explicit instance features to reduce excessive reliance on training data. Specifically, we introduce the perspective-view (PV) detection branch to obtain explicit structural features, in which the uncertainty-aware decoder is designed to dynamically sample probability distributions considering the difference in scenes. With probabilistic embedding and selection, UI2DPrompt is proposed to construct PV-learnable prompts. These PV prompts are integrated into the map decoder by designed hybrid injection to compensate for neglected instance structures. To ensure real-time inference, a lightweight Mimic Query Distillation is designed to learn from PV prompts, which can serve as an efficient alternative to the flow of PV branches. Extensive experiments on challenging geographically disjoint (geo-based) data splits demonstrate that our UIGenMap achieves superior performance, with +5.7 mAP improvement on the nuScenes dataset. Source code will be available at https://github.com/xiaolul2/UIGenMap.

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

TL;DR

Abstract

Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)