Exploring Semi-Supervised Learning for Online Mapping

Adam Lilja; Erik Wallin; Junsheng Fu; Lars Hammarstrand

Exploring Semi-Supervised Learning for Online Mapping

Adam Lilja, Erik Wallin, Junsheng Fu, Lars Hammarstrand

TL;DR

This work adapts the teacher–student semi-supervised learning paradigm to online BEV-based mapping, introducing temporal fusion of teacher pseudo-labels across frames to exploit frame-to-frame consistency. Leveraging limited labeled data and large unlabeled datasets, the approach combines strong augmentations, thresholding, and a multi-frame teacher fusion to substantially improve static-class map predictions while enabling robust generalization to unseen cities. Key findings show up to a 3.5x improvement over label-only training with 10% labeled data, significant gains across multiple online-mapping architectures, and meaningful domain adaptation benefits when incorporating unlabelled target-domain sequences. The study provides a practical SSL blueprint for online mapping that reduces labeling requirements and enhances deployment prospects in diverse urban environments.

Abstract

The ability to generate online maps using only onboard sensory information is crucial for enabling autonomous driving beyond well-mapped areas. Training models for this task -- predicting lane markers, road edges, and pedestrian crossings -- traditionally require extensive labelled data, which is expensive and labour-intensive to obtain. While semi-supervised learning (SSL) has shown promise in other domains, its potential for online mapping remains largely underexplored. In this work, we bridge this gap by demonstrating the effectiveness of SSL methods for online mapping. Furthermore, we introduce a simple yet effective method leveraging the inherent properties of online mapping by fusing the teacher's pseudo-labels from multiple samples, enhancing the reliability of self-supervised training. If 10% of the data has labels, our method to leverage unlabelled data achieves a 3.5x performance boost compared to only using the labelled data. This narrows the gap to a fully supervised model, using all labels, to just 3.5 mIoU. We also show strong generalization to unseen cities. Specifically, in Argoverse 2, when adapting to Pittsburgh, incorporating purely unlabelled target-domain data reduces the performance gap from 5 to 0.5 mIoU. These results highlight the potential of SSL as a powerful tool for solving the online mapping problem, significantly reducing reliance on labelled data.

Exploring Semi-Supervised Learning for Online Mapping

TL;DR

Abstract

Exploring Semi-Supervised Learning for Online Mapping

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)