R3ST: A Synthetic 3D Dataset With Realistic Trajectories
Simone Teglia, Claudia Melis Tonti, Francesco Pro, Leonardo Russo, Andrea Alfarano, Leonardo Pentassuglia, Irene Amerini
TL;DR
R3ST tackles the gap between synthetic data convenience and realism in vehicle motion by embedding real trajectories from SinD into Blender-rendered urban intersections, producing photorealistic imagery with rich multimodal annotations. The dataset spans two intersections with four camera views and over 80K frames, enabling tasks from object detection to monocular depth estimation and trajectory forecasting. By evaluating pre-trained models and showing strong detection performance after fine-tuning, the work demonstrates R3ST's utility for training and evaluating autonomous driving and traffic-analysis systems. This approach promises better generalization to real-world driving, addressing domain shift while preserving the benefits of synthetic data generation.
Abstract
Datasets are essential to train and evaluate computer vision models used for traffic analysis and to enhance road safety. Existing real datasets fit real-world scenarios, capturing authentic road object behaviors, however, they typically lack precise ground-truth annotations. In contrast, synthetic datasets play a crucial role, allowing for the annotation of a large number of frames without additional costs or extra time. However, a general drawback of synthetic datasets is the lack of realistic vehicle motion, since trajectories are generated using AI models or rule-based systems. In this work, we introduce R3ST (Realistic 3D Synthetic Trajectories), a synthetic dataset that overcomes this limitation by generating a synthetic 3D environment and integrating real-world trajectories derived from SinD, a bird's-eye-view dataset recorded from drone footage. The proposed dataset closes the gap between synthetic data and realistic trajectories, advancing the research in trajectory forecasting of road vehicles, offering both accurate multimodal ground-truth annotations and authentic human-driven vehicle trajectories.
