Accurate Training Data for Occupancy Map Prediction in Automated Driving Using Evidence Theory
Jonas Kälble, Sascha Wirges, Maxim Tatarchenko, Eddy Ilg
TL;DR
This work targets the quality of ground-truth data for training occupancy map prediction in automated driving, identifying substantial weaknesses in LIDAR-derived GT used by current benchmarks. It proposes an evidential occupancy mapping pipeline that converts LIDAR measurements into a 3D grid of belief masses over occupied, free, and uncertain voxels using spherical mappings, multi-frame aggregation, and Dempster-Shafer theory. The approach yields significantly more accurate occupancy reconstructions (MAE improvements of $30\%-52\%$ on nuScenes and $53\%$ on Waymo) and provides meaningful per-voxel uncertainty, which is leveraged with an observability-based loss weighting to improve state-of-the-art occupancy prediction by about $25\%$ in MAE. This evidential GT data, along with the uncertainty-aware training, enhances safety-critical perception tasks and motivates future integration of semantic information.
Abstract
Automated driving fundamentally requires knowledge about the surrounding geometry of the scene. Modern approaches use only captured images to predict occupancy maps that represent the geometry. Training these approaches requires accurate data that may be acquired with the help of LiDAR scanners. We show that the techniques used for current benchmarks and training datasets to convert LiDAR scans into occupancy grid maps yield very low quality, and subsequently present a novel approach using evidence theory that yields more accurate reconstructions. We demonstrate that these are superior by a large margin, both qualitatively and quantitatively, and that we additionally obtain meaningful uncertainty estimates. When converting the occupancy maps back to depth estimates and comparing them with the raw LiDAR measurements, our method yields a MAE improvement of 30% to 52% on nuScenes and 53% on Waymo over other occupancy ground-truth data. Finally, we use the improved occupancy maps to train a state-of-the-art occupancy prediction method and demonstrate that it improves the MAE by 25% on nuScenes.
