Label-Free Model Failure Detection for Lidar-based Point Cloud Segmentation
Daniel Bogdoll, Finn Sartoris, Vincent Geppert, Svetlana Pavlitska, J. Marius Zöllner
TL;DR
This paper tackles the inadequacy of small labeled evaluation sets for revealing failures in lidar-based point cloud segmentation. It introduces a label-free failure-detection framework based on complementary learning: a supervised stream yields semantic motion labels and a self-supervised stream yields predictive motion labels, with per-point discrepancies used to flag potential failures. The authors also present LidarCODA, the first public real-world lidar anomaly dataset with point-wise 3D labels, enabling quantitative analysis of failure sensitivity to outliers. Experiments on regular and anomalous scenarios show the approach uncovers failures beyond what small validation sets reveal, supporting better deployment and data-collection strategies for large-scale autonomous driving systems.
Abstract
Autonomous vehicles drive millions of miles on the road each year. Under such circumstances, deployed machine learning models are prone to failure both in seemingly normal situations and in the presence of outliers. However, in the training phase, they are only evaluated on small validation and test sets, which are unable to reveal model failures due to their limited scenario coverage. While it is difficult and expensive to acquire large and representative labeled datasets for evaluation, large-scale unlabeled datasets are typically available. In this work, we introduce label-free model failure detection for lidar-based point cloud segmentation, taking advantage of the abundance of unlabeled data available. We leverage different data characteristics by training a supervised and self-supervised stream for the same task to detect failure modes. We perform a large-scale qualitative analysis and present LidarCODA, the first publicly available dataset with labeled anomalies in real-world lidar data, for an extensive quantitative analysis.
