Addressing Data Annotation Challenges in Multiple Sensors: A Solution for Scania Collected Datasets
Ajinkya Khoche, Aron Asefaw, Alejandro Gonzalez, Bogdan Timus, Sina Sharif Mansouri, Patric Jensfelt
TL;DR
The paper tackles data annotation challenges in multi-sensor autonomous-vehicle datasets for heavy vehicles by modeling non-ego object motion with a planar 2D state $ \bm{x}= [d,s]^T $ along a trajectory of headings $\bm{\Theta}$ and using constant-acceleration dynamics. A Moving Horizon Estimation (MHE) framework processes a track of noisy human annotations to produce robust speed estimates $s^\ast$ over a horizon $N_e$, which are then used to speed-compensate sensor points and refine 3D bounding boxes. The refinement pipeline clusters views by heading, generates $G$ pseudo bounding boxes, and assigns them to speed-compensated clusters, enabling better coverage of dynamic objects across sensors. Validation on real Scania truck/bus data shows smoother speed trajectories and improved annotation completeness, with potential to enhance ground-truth quality for downstream perception and evaluation tasks, and the approach can be extended to additional object classes and longer sequences.
Abstract
Data annotation in autonomous vehicles is a critical step in the development of Deep Neural Network (DNN) based models or the performance evaluation of the perception system. This often takes the form of adding 3D bounding boxes on time-sequential and registered series of point-sets captured from active sensors like Light Detection and Ranging (LiDAR) and Radio Detection and Ranging (RADAR). When annotating multiple active sensors, there is a need to motion compensate and translate the points to a consistent coordinate frame and timestamp respectively. However, highly dynamic objects pose a unique challenge, as they can appear at different timestamps in each sensor's data. Without knowing the speed of the objects, their position appears to be different in different sensor outputs. Thus, even after motion compensation, highly dynamic objects are not matched from multiple sensors in the same frame, and human annotators struggle to add unique bounding boxes that capture all objects. This article focuses on addressing this challenge, primarily within the context of Scania collected datasets. The proposed solution takes a track of an annotated object as input and uses the Moving Horizon Estimation (MHE) to robustly estimate its speed. The estimated speed profile is utilized to correct the position of the annotated box and add boxes to object clusters missed by the original annotation.
