Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning
Julius Rückin, Federico Magistri, Cyrill Stachniss, Marija Popović
TL;DR
This work tackles semantic segmentation for robots operating in unknown environments by marrying sparse human labeling with self-generated pseudo labels through an adaptive, map-guided data-collection strategy. It introduces a probabilistic, multi-layer semantic environment map and an uncertainty-driven frontier planner to collect informative data under budget constraints, followed by semi-supervised retraining that fuses sparse human labels with uncertainty-aware pseudo labels. Key contributions include (i) probabilistic semantic environment mapping, (ii) an adaptive frontier-based planning objective, (iii) a semi-supervised training regime with region-impurity-guided sparse labeling, and (iv) uncertainty-aware pseudo-label generation that outperforms self-supervised baselines while greatly reducing labeling effort. Experimental results on the ISPRS Potsdam dataset show substantial gains over non-targeted labeling and self-supervised methods, with semi-supervised performance approaching fully supervised baselines using only a fraction of human labels, highlighting practical impact for autonomous robotic perception in new environments.
Abstract
Semantic segmentation enables robots to perceive and reason about their environments beyond geometry. Most of such systems build upon deep learning approaches. As autonomous robots are commonly deployed in initially unknown environments, pre-training on static datasets cannot always capture the variety of domains and limits the robot's perception performance during missions. Recently, self-supervised and fully supervised active learning methods emerged to improve a robot's vision. These approaches rely on large in-domain pre-training datasets or require substantial human labelling effort. We propose a planning method for semi-supervised active learning of semantic segmentation that substantially reduces human labelling requirements compared to fully supervised approaches. We leverage an adaptive map-based planner guided towards the frontiers of unexplored space with high model uncertainty collecting training data for human labelling. A key aspect of our approach is to combine the sparse high-quality human labels with pseudo labels automatically extracted from highly certain environment map areas. Experimental results show that our method reaches segmentation performance close to fully supervised approaches with drastically reduced human labelling effort while outperforming self-supervised approaches.
