COARSE: Collaborative Pseudo-Labeling with Coarse Real Labels for Off-Road Semantic Segmentation
Aurelio Noca, Xianmei Lei, Jonathan Becktor, Jeffrey Edlund, Anna Sabel, Patrick Spieler, Curtis Padgett, Alexandre Alahi, Deegan Atha
TL;DR
Off-road semantic segmentation suffers from scarce dense labels and strong domain gaps. COARSE introduces a semi-supervised domain adaptation framework that combines sparse in-domain coarse labels with densely labeled out-of-domain data via a collaborative pseudo-labeling pipeline built on a DINOv2 backbone and two decoders (PixelDecoder and PatchDecoder). The method achieves substantial $mIoU$ gains on Rellis-3D and RUGD (8.4% and 9.7% over coarse-label baselines) and demonstrates applicability in real-world multi-biome driving scenarios, showcasing data-efficient learning and robust generalization. This work reduces labeling costs while leveraging unlabeled and simulated data to enhance off-road perception for autonomous navigation.
Abstract
Autonomous off-road navigation faces challenges due to diverse, unstructured environments, requiring robust perception with both geometric and semantic understanding. However, scarce densely labeled semantic data limits generalization across domains. Simulated data helps, but introduces domain adaptation issues. We propose COARSE, a semi-supervised domain adaptation framework for off-road semantic segmentation, leveraging sparse, coarse in-domain labels and densely labeled out-of-domain data. Using pretrained vision transformers, we bridge domain gaps with complementary pixel-level and patch-level decoders, enhanced by a collaborative pseudo-labeling strategy on unlabeled data. Evaluations on RUGD and Rellis-3D datasets show significant improvements of 9.7\% and 8.4\% respectively, versus only using coarse data. Tests on real-world off-road vehicle data in a multi-biome setting further demonstrate COARSE's applicability.
