Social Coordination and Altruism in Autonomous Driving
Behrad Toghi, Rodolfo Valiente, Dorsa Sadigh, Ramtin Pedarsani, Yaser P. Fallah
TL;DR
The paper tackles safety and efficiency in mixed-autonomy traffic by modeling AV-HV decision-making as a partially observable stochastic game and training altruistic autonomous agents via deep multi-agent reinforcement learning. It introduces a decentralized reward framework that splits altruism into sympathy (toward HVs) and cooperation (among AVs) and optimizes a social value orientation angle to balance self and group utilities. Through a highway merging case study, the authors show that altruistic AVs can form alliances, influence HV behavior, and substantially improve merging success, traffic flow, and safety, while a purely egoistic approach underperforms. The work also proposes a semi-sequential training paradigm to mitigate non-stationarity and demonstrates robustness to different human-driver models, highlighting the practical potential and limitations of deploying altruistic coordination in real-world traffic.
Abstract
Despite the advances in the autonomous driving domain, autonomous vehicles (AVs) are still inefficient and limited in terms of cooperating with each other or coordinating with vehicles operated by humans. A group of autonomous and human-driven vehicles (HVs) which work together to optimize an altruistic social utility -- as opposed to the egoistic individual utility -- can co-exist seamlessly and assure safety and efficiency on the road. Achieving this mission without explicit coordination among agents is challenging, mainly due to the difficulty of predicting the behavior of humans with heterogeneous preferences in mixed-autonomy environments. Formally, we model an AV's maneuver planning in mixed-autonomy traffic as a partially-observable stochastic game and attempt to derive optimal policies that lead to socially-desirable outcomes using a multi-agent reinforcement learning framework. We introduce a quantitative representation of the AVs' social preferences and design a distributed reward structure that induces altruism into their decision making process. Our altruistic AVs are able to form alliances, guide the traffic, and affect the behavior of the HVs to handle competitive driving scenarios. As a case study, we compare egoistic AVs to our altruistic autonomous agents in a highway merging setting and demonstrate the emerging behaviors that lead to a noticeable improvement in the number of successful merges as well as the overall traffic flow and safety.
