Hacking Predictors Means Hacking Cars: Using Sensitivity Analysis to Identify Trajectory Prediction Vulnerabilities for Autonomous Driving Security
Marsalis Gibson, David Babazadeh, Claire Tomlin, Shankar Sastry
TL;DR
This paper investigates how perturbations to inputs of multi-modal trajectory predictors affect downstream autonomous-driving planning. Using sensitivity analysis on Trajectron++ and AgentFormer, it shows that predictions are most sensitive to the most recent state histories, but small image perturbations via FGSM can also cause large prediction errors, leading to abrupt stops in planning. The study defines a perturbation-attribution framework using $ADE$ as the performance metric and demonstrates through planning experiments that adversarial inputs can transfer to control decisions. These findings highlight practical security risks in deploying neural trajectory predictors and motivate robustness defenses to limit the attack surface in cyber-physical driving systems, with $p(Y|X)=\int p_{\phi}(Y|X,Z) p_{\psi}(Z|X)\,dZ$ guiding the model characterization and sensitivity attribution.
Abstract
Adversarial attacks on learning-based multi-modal trajectory predictors have already been demonstrated. However, there are still open questions about the effects of perturbations on inputs other than state histories, and how these attacks impact downstream planning and control. In this paper, we conduct a sensitivity analysis on two trajectory prediction models, Trajectron++ and AgentFormer. The analysis reveals that between all inputs, almost all of the perturbation sensitivities for both models lie only within the most recent position and velocity states. We additionally demonstrate that, despite dominant sensitivity on state history perturbations, an undetectable image map perturbation made with the Fast Gradient Sign Method can induce large prediction error increases in both models, revealing that these trajectory predictors are, in fact, susceptible to image-based attacks. Using an optimization-based planner and example perturbations crafted from sensitivity results, we show how these attacks can cause a vehicle to come to a sudden stop from moderate driving speeds.
