WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection
Tsung-Lin Tsou, Tsung-Han Wu, Winston H. Hsu
TL;DR
This work tackles practical weakly-supervised domain adaptation for 3D object detection by formulating the problem as adapting from a labeled source domain $D_s$ to a weakly-labeled target domain $D_t$ and proposing WLST, a three-stage framework that couples a 3D detector with an autolabeler to generate robust pseudo labels from 2D weak labels. A key novelty is the consistency fusion strategy, which jointly exploits geometric consistency and cross-modality cues to select high-quality pseudo labels from both modalities, enabling effective self-training under domain shifts. The approach is demonstrated on three benchmarks (Waymo, nuScenes, KITTI) and consistently outperforms prior unsupervised and weakly-supervised DA methods, significantly closing the gap to the fully supervised Oracle while remaining detector- and autolabeler-agnostic. The work offers a cost-effective path toward deploying robust 3D detectors in real-world, cross-domain autonomous driving scenarios, leveraging weak target annotations and cross-modal supervision.
Abstract
In the field of domain adaptation (DA) on 3D object detection, most of the work is dedicated to unsupervised domain adaptation (UDA). Yet, without any target annotations, the performance gap between the UDA approaches and the fully-supervised approach is still noticeable, which is impractical for real-world applications. On the other hand, weakly-supervised domain adaptation (WDA) is an underexplored yet practical task that only requires few labeling effort on the target domain. To improve the DA performance in a cost-effective way, we propose a general weak labels guided self-training framework, WLST, designed for WDA on 3D object detection. By incorporating autolabeler, which can generate 3D pseudo labels from 2D bounding boxes, into the existing self-training pipeline, our method is able to generate more robust and consistent pseudo labels that would benefit the training process on the target domain. Extensive experiments demonstrate the effectiveness, robustness, and detector-agnosticism of our WLST framework. Notably, it outperforms previous state-of-the-art methods on all evaluation tasks.
