DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising
Xiaohu Lu, Hayder Radha
TL;DR
This work tackles the problem of domain shift in LiDAR-based 3D object detection by addressing noise in pseudo labels used in unsupervised domain adaptation. It introduces DALI, a two-pronged approach combining Post-training Size Normalization (PTSN) to correct distribution-level size bias via an optimal scale $\hat{s}$, and Pseudo Point Clouds Generation (PPCG) with ray-constrained and constraint-free variants to reduce instance-level misalignment. Across KITTI, Waymo, and nuScenes, and with backbones like SECOND-IoU and PV-RCNN, DALI achieves state-of-the-art performance on multiple cross-domain tasks, demonstrating strong cross-domain and within-domain robustness. The method offers a practical, geometry-driven augmentation to refine pseudo labels without extra target annotations, with future work extending to non-rigid objects and adverse weather conditions.
Abstract
Object detection using LiDAR point clouds relies on a large amount of human-annotated samples when training the underlying detectors' deep neural networks. However, generating 3D bounding box annotation for a large-scale dataset could be costly and time-consuming. Alternatively, unsupervised domain adaptation (UDA) enables a given object detector to operate on a novel new data, with unlabeled training dataset, by transferring the knowledge learned from training labeled \textit{source domain} data to the new unlabeled \textit{target domain}. Pseudo label strategies, which involve training the 3D object detector using target-domain predicted bounding boxes from a pre-trained model, are commonly used in UDA. However, these pseudo labels often introduce noise, impacting performance. In this paper, we introduce the Domain Adaptive LIdar (DALI) object detection framework to address noise at both distribution and instance levels. Firstly, a post-training size normalization (PTSN) strategy is developed to mitigate bias in pseudo label size distribution by identifying an unbiased scale after network training. To address instance-level noise between pseudo labels and corresponding point clouds, two pseudo point clouds generation (PPCG) strategies, ray-constrained and constraint-free, are developed to generate pseudo point clouds for each instance, ensuring the consistency between pseudo labels and pseudo points during training. We demonstrate the effectiveness of our method on the publicly available and popular datasets KITTI, Waymo, and nuScenes. We show that the proposed DALI framework achieves state-of-the-art results and outperforms leading approaches on most of the domain adaptation tasks. Our code is available at \href{https://github.com/xiaohulugo/T-RO2024-DALI}{https://github.com/xiaohulugo/T-RO2024-DALI}.
