Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints
Yueyang Wang, Mehmet Dogar, Gustav Markkula
TL;DR
This paper tackles realistic pedestrian–driver interactions at unsignalised crossings by introducing a two-agent MARL framework that embeds human perceptual and motor constraints, including gaze-dependent acuity and ballistic motor control. It introduces gaze-aware visual processing, Bayesian perception, and motion-cost penalties, and models inter-individual variability through population-level distributions of non-policy parameters, evaluated on real-world one-to-one crossing data. Among four variants (NC, MC, VC, VMC), the Visual-and-Motor-Constraint (VMC) model achieves the best fit to real trajectories, evidenced by the lowest composite $NLL$ and closest trajectory reproductions, outperforming a behavioural cloning baseline in a data-limited setting. The work demonstrates that integrating both perceptual uncertainty and motor execution constraints improves realism in interactive road-user modelling, with implications for safer autonomous-vehicle planning and evaluation.
Abstract
Modelling pedestrian-driver interactions is critical for understanding human road user behaviour and developing safe autonomous vehicle systems. Existing approaches often rely on rule-based logic, game-theoretic models, or 'black-box' machine learning methods. However, these models typically lack flexibility or overlook the underlying mechanisms, such as sensory and motor constraints, which shape how pedestrians and drivers perceive and act in interactive scenarios. In this study, we propose a multi-agent reinforcement learning (RL) framework that integrates both visual and motor constraints of pedestrian and driver agents. Using a real-world dataset from an unsignalised pedestrian crossing, we evaluate four model variants, one without constraints, two with either motor or visual constraints, and one with both, across behavioural metrics of interaction realism. Results show that the combined model with both visual and motor constraints performs best. Motor constraints lead to smoother movements that resemble human speed adjustments during crossing interactions. The addition of visual constraints introduces perceptual uncertainty and field-of-view limitations, leading the agents to exhibit more cautious and variable behaviour, such as less abrupt deceleration. In this data-limited setting, our model outperforms a supervised behavioural cloning model, demonstrating that our approach can be effective without large training datasets. Finally, our framework accounts for individual differences by modelling parameters controlling the human constraints as population-level distributions, a perspective that has not been explored in previous work on pedestrian-vehicle interaction modelling. Overall, our work demonstrates that multi-agent RL with human constraints is a promising modelling approach for simulating realistic road user interactions.
