Flow-guided Motion Prediction with Semantics and Dynamic Occupancy Grid Maps
Rabbia Asghar, Wenqian Liu, Lukas Rummelhard, Anne Spalanzani, Christian Laugier
TL;DR
This work tackles the problem of multi-step driving-scene prediction by fusing probabilistic Dynamic Occupancy Grid Maps (DOGMs) with semantic information to predict both future semantic grids and scene flow. It introduces a flow-guided, multi-task framework that outputs a sequence of semantic grids and per-cell flow maps, then warps the current semantic grid using the predicted flows to obtain warped future grids. The model employs a conditional variational autoencoder with ConvLSTM/ConvGRU components and a dual decoder to produce present and future predictions, trained with BCE losses for semantics, an L1 flow loss, and a KL regularizer. Evaluated on NuScenes, the approach yields improved prediction accuracy and better retention of dynamic vehicles, highlighting the practical impact of incorporating scene flow for autonomous driving planning in allo-centric coordinates.
Abstract
Accurate prediction of driving scenes is essential for road safety and autonomous driving. Occupancy Grid Maps (OGMs) are commonly employed for scene prediction due to their structured spatial representation, flexibility across sensor modalities and integration of uncertainty. Recent studies have successfully combined OGMs with deep learning methods to predict the evolution of scene and learn complex behaviours. These methods, however, do not consider prediction of flow or velocity vectors in the scene. In this work, we propose a novel multi-task framework that leverages dynamic OGMs and semantic information to predict both future vehicle semantic grids and the future flow of the scene. This incorporation of semantic flow not only offers intermediate scene features but also enables the generation of warped semantic grids. Evaluation on the real-world NuScenes dataset demonstrates improved prediction capabilities and enhanced ability of the model to retain dynamic vehicles within the scene.
