Table of Contents
Fetching ...

Uncertainty-aware Short-term Motion Prediction of Traffic Actors for Autonomous Driving

Nemanja Djuric, Vladan Radosavljevic, Henggang Cui, Thi Nguyen, Fang-Chieh Chou, Tsung-Han Lin, Nitin Singh, Jeff Schneider

TL;DR

The paper tackles short-term motion prediction for traffic actors around autonomous vehicles by rasterizing actor-centric scenes with high-definition map context and feeding them to CNNs to predict future positions and associated uncertainties. It introduces an uncertainty-aware loss based on a half-normal model to capture aleatoric uncertainty and optionally uses an LSTM decoder for improved temporal modeling. The approach integrates dynamic state and map information, achieves strong empirical performance across multiple CNN backbones, and demonstrates well-calibrated uncertainty via reliability analyses and case studies. Extensive real-world data and onboard testing validate its practical potential for safer, more reliable SDV planning.

Abstract

We address one of the crucial aspects necessary for safe and efficient operations of autonomous vehicles, namely predicting future state of traffic actors in the autonomous vehicle's surroundings. We introduce a deep learning-based approach that takes into account a current world state and produces raster images of each actor's vicinity. The rasters are then used as inputs to deep convolutional models to infer future movement of actors while also accounting for and capturing inherent uncertainty of the prediction task. Extensive experiments on real-world data strongly suggest benefits of the proposed approach. Moreover, following completion of the offline tests the system was successfully tested onboard self-driving vehicles.

Uncertainty-aware Short-term Motion Prediction of Traffic Actors for Autonomous Driving

TL;DR

The paper tackles short-term motion prediction for traffic actors around autonomous vehicles by rasterizing actor-centric scenes with high-definition map context and feeding them to CNNs to predict future positions and associated uncertainties. It introduces an uncertainty-aware loss based on a half-normal model to capture aleatoric uncertainty and optionally uses an LSTM decoder for improved temporal modeling. The approach integrates dynamic state and map information, achieves strong empirical performance across multiple CNN backbones, and demonstrates well-calibrated uncertainty via reliability analyses and case studies. Extensive real-world data and onboard testing validate its practical potential for safer, more reliable SDV planning.

Abstract

We address one of the crucial aspects necessary for safe and efficient operations of autonomous vehicles, namely predicting future state of traffic actors in the autonomous vehicle's surroundings. We introduce a deep learning-based approach that takes into account a current world state and produces raster images of each actor's vicinity. The rasters are then used as inputs to deep convolutional models to infer future movement of actors while also accounting for and capturing inherent uncertainty of the prediction task. Extensive experiments on real-world data strongly suggest benefits of the proposed approach. Moreover, following completion of the offline tests the system was successfully tested onboard self-driving vehicles.

Paper Structure

This paper contains 13 sections, 5 equations, 7 figures, 1 table.

Figures (7)

  • Figure 1: Complex intersection scene handled by our model; (a) scene in a 3D viewer, with lane boundaries, surrounding actors, and actor of interest (indicated in yellow); (b) rasterized surroundings of the actor of interest (colored red) in bird's-eye view used as an input to CNN; (c) raster with overlaid ground-truth (dotted green line) and predicted (dotted blue line) 3s-trajectories
  • Figure 2: Feed-forward network architecture combining raster image and actor state inputs
  • Figure 3: LSTM decoder
  • Figure 4: Reliability diagrams at horizons of: (a) 1s; (b) 3s
  • Figure 5: Analysis of the MNv2 model on the three case studies, with results overlaid over the input raster images; the first column shows ground-truth (dotted green line) and predicted (dotted blue line) 3-second trajectories, the second column shows aleatoric uncertainty output by the model, the third column shows epistemic uncertainty estimated by dropout analysis, the fourth column shows relevant parts of raster estimated by occlusion sensitivity analysis; state inputs are provided above the rasters in the first column, indicating velocity (v) in $m/s$, acceleration (a) in $m/s^2$, heading change rate (hcr) in $deg/s$
  • ...and 2 more figures