Does Unpredictability Influence Driving Behavior?
Sepehr Samavi, Florian Shkurti, Angela P. Schoellig
TL;DR
The paper addresses how surrounding driver unpredictability influences ego-vehicle lane-change behavior. It introduces an unpredictability metric derived from the error of a trajectory predictor and embeds it as an additional feature in a Maximum Entropy Inverse Reinforcement Learning framework to learn two lane-change reward functions (baseline and unpredictability-aware). Evaluations on datasets including US-101, I-80, and highD show the unpredictability-aware rewards produce trajectories that better fit human data, with an average MEE improvement of about 5.9% on test sets and qualitatively more cautious maneuvers when adjacent cars are unpredictable. The work suggests unpredictability is a valuable signal for human-aligned planning and paves the way for exploring alternate predictors and nonlinear reward structures in driving policy design.
Abstract
In this paper we investigate the effect of the unpredictability of surrounding cars on an ego-car performing a driving maneuver. We use Maximum Entropy Inverse Reinforcement Learning to model reward functions for an ego-car conducting a lane change in a highway setting. We define a new feature based on the unpredictability of surrounding cars and use it in the reward function. We learn two reward functions from human data: a baseline and one that incorporates our defined unpredictability feature, then compare their performance with a quantitative and qualitative evaluation. Our evaluation demonstrates that incorporating the unpredictability feature leads to a better fit of human-generated test data. These results encourage further investigation of the effect of unpredictability on driving behavior.
