ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios
Dingrui Wang, Zheyuan Lai, Yuda Li, Yi Wu, Yuexin Ma, Johannes Betz, Ruigang Yang, Wei Li
TL;DR
The paper tackles long-term prediction in emergency autonomous driving by introducing the ESP problem and the ESP-Dataset, which encodes rich extrospective semantic cues from the environment. It proposes a lightweight ESP encoder that can be plugged into existing predictors and a new time-aware evaluation metric, CT E, to properly assess sub-second event timing. Experimental results show ESP features consistently boost state-of-the-art backbones like TNT and MTR, with ablations highlighting the contribution of individual ESP components. The work also demonstrates the potential of integrating ESP with large language models to reason about extrospective cues, offering a path toward safer, more anticipatory autonomous driving in rare but critical scenarios.
Abstract
Emergent-scene safety is the key milestone for fully autonomous driving, and reliable on-time prediction is essential to maintain safety in emergency scenarios. However, these emergency scenarios are long-tailed and hard to collect, which restricts the system from getting reliable predictions. In this paper, we build a new dataset, which aims at the long-term prediction with the inconspicuous state variation in history for the emergency event, named the Extro-Spective Prediction (ESP) problem. Based on the proposed dataset, a flexible feature encoder for ESP is introduced to various prediction methods as a seamless plug-in, and its consistent performance improvement underscores its efficacy. Furthermore, a new metric named clamped temporal error (CTE) is proposed to give a more comprehensive evaluation of prediction performance, especially in time-sensitive emergency events of subseconds. Interestingly, as our ESP features can be described in human-readable language naturally, the application of integrating into ChatGPT also shows huge potential. The ESP-dataset and all benchmarks are released at https://dingrui-wang.github.io/ESP-Dataset/.
