Modeling Pedestrian Crossing Behavior: A Reinforcement Learning Approach with Sensory Motor Constraints
Yueyang Wang, Aravinda Ramakrishnan Srinivasan, Yee Mun Lee, Gustav Markkula
TL;DR
This work tackles safe autonomous-vehicle interactions with pedestrians by proposing a constrained reinforcement learning framework that embeds sensory-motor realism, including noisy perception, looming aversion, time pressure, walking effort, and ballistic speed control, plus a biomechanical walking model. It uses a human-in-the-loop dataset from a controlled crossing experiment and optimizes non-policy parameters via Bayesian optimization, while policy learning is conducted with Proximal Policy Optimization in a PBOMDP setting. The results show that the fully sensory-motor (SM) variant reproduces a broad set of human-like behaviors, notably gap acceptance, CIT, day/night differences, and realistic walking speed profiles, underscoring the importance of integrating sensory-motor constraints for realism and generalization. The findings have practical implications for improving AV decision-making, virtual testing, and traffic-safety modeling, by providing a more accurate and interpretable framework for pedestrian-vehicle interactions.
Abstract
Understanding pedestrian behavior is crucial for the safe deployment of Autonomous Vehicles (AVs) in urban environments. Traditional pedestrian behavior models often fall into two categories: mechanistic models, which do not generalize well to complex environments, and machine-learned models, which generally overlook sensory-motor constraints influencing human behavior and thus prone to fail in untrained scenarios. We hypothesize that sensory-motor constraints, fundamental to how humans perceive and interact with their surroundings, are essential for realistic simulations. Thus, we introduce a constrained reinforcement learning (RL) model that simulates the crossing decision and locomotion of pedestrians. It was constrained to emulate human sensory mechanisms with noisy visual perception and looming aversion. Additionally, human motor constraint was incorporated through a bio-mechanical model of walking. We gathered data from a human-in-the-loop experiment to understand pedestrian behavior. The findings reveal several phenomena not addressed by existing pedestrian models, regarding how pedestrians adapt their walking speed to the kinematics and behavior of the approaching vehicle. Our model successfully captures these human-like walking speed patterns, enabling us to understand these patterns as a trade-off between time pressure and walking effort. Importantly, the model retains the ability to reproduce various phenomena previously captured by a simpler version of the model. Additionally, phenomena related to external human-machine interfaces and light conditions were also included. Overall, our results not only demonstrate the potential of constrained RL in modeling pedestrian behaviors but also highlight the importance of sensory-motor mechanisms in modeling pedestrian-vehicle interactions.
