Uncertainty-Aware AB3DMOT by Variational 3D Object Detection
Illia Oleksiienko, Alexandros Iosifidis
TL;DR
The paper addresses robust 3D object detection and tracking under uncertainty for autonomous driving. It introduces a Variational TANet detector that outputs mean predictions and uncertainty, and integrates this with an uncertainty-aware AB3DMOT via a linear transformation of the predicted uncertainty $\hat{\Sigma} = \alpha I + \beta \Sigma$ to serve as Kalman filter measurement noise. It presents two uncertainty pathways—internal variance and external covariance—and shows external covariance yields stronger tracking gains, with transfer learning from a pretrained TANet to IVTANet providing the best performance. Experiments on KITTI track demonstrate improved MOTA and F1 scores, validating the practical impact of uncertainty-aware 3D MOT and the value of transferring existing models to variational variants.
Abstract
Autonomous driving needs to rely on high-quality 3D object detection to ensure safe navigation in the world. Uncertainty estimation is an effective tool to provide statistically accurate predictions, while the associated detection uncertainty can be used to implement a more safe navigation protocol or include the user in the loop. In this paper, we propose a Variational Neural Network-based TANet 3D object detector to generate 3D object detections with uncertainty and introduce these detections to an uncertainty-aware AB3DMOT tracker. This is done by applying a linear transformation to the estimated uncertainty matrix, which is subsequently used as a measurement noise for the adopted Kalman filter. We implement two ways to estimate output uncertainty, i.e., internally, by computing the variance of the CNN outputs and then propagating the uncertainty through the post-processing, and externally, by associating the final predictions of different samples and computing the covariance of each predicted box. In experiments, we show that the external uncertainty estimation leads to better results, outperforming both internal uncertainty estimation and classical tracking approaches. Furthermore, we propose a method to initialize the Variational 3D object detector with a pretrained TANet model, which leads to the best performing models.
