Generating Synthetic Ground Truth Distributions for Multi-step Trajectory Prediction using Probabilistic Composite Bézier Curves
Ronny Hug, Stefan Becker, Wolfgang Hübner, Michael Arens
TL;DR
The paper tackles the lack of ground-truth distribution data for multi-step trajectory prediction by introducing composite probabilistic Bézier curves, or N-Curves, which generate full-trajectory distributions and permit posterior conditioning via Gaussian-process equivalence. It defines a multi-path dataset as a mixture of N-Curves, derives the prior as a Gaussian mixture with a structured mean and covariance, and computes posteriors by conditioning on observed trajectory segments. An exemplary evaluation demonstrates training a multi-modal predictor (RED) on synthetic data and assessing performance with the Wasserstein distance in addition to NLL, arguing for the greater interpretability and variance-awareness of distributional metrics. The work enables more expressive benchmarking and posterior analysis for probabilistic trajectory predictors, at the cost of additional computational complexity in evaluating distributional distances.
Abstract
An appropriate data basis grants one of the most important aspects for training and evaluating probabilistic trajectory prediction models based on neural networks. In this regard, a common shortcoming of current benchmark datasets is their limitation to sets of sample trajectories and a lack of actual ground truth distributions, which prevents the use of more expressive error metrics, such as the Wasserstein distance for model evaluation. Towards this end, this paper proposes a novel approach to synthetic dataset generation based on composite probabilistic Bézier curves, which is capable of generating ground truth data in terms of probability distributions over full trajectories. This allows the calculation of arbitrary posterior distributions. The paper showcases an exemplary trajectory prediction model evaluation using generated ground truth distribution data.
