An End-to-End Collaborative Learning Approach for Connected Autonomous Vehicles in Occluded Scenarios
Leandro Parada, Hanlin Tian, Jose Escribano, Panagiotis Angeloudis
TL;DR
This work tackles safe navigation in occluded intersections where single-agent policies struggle. It introduces an end-to-end collaborative learning framework that uses V2V sharing of compressed LiDAR features and Multi-agent Proximal Policy Optimisation to train decentralized policies. The approach demonstrates superior safety and efficiency compared to independent RL and cooperative early fusion baselines while respecting practical bandwidth constraints, leveraging a CARLA-based occluded intersection environment. The findings highlight the benefits of collaborative perception for CAVs in occluded scenarios and underscore the importance of robust performance under realistic communication limits and sensor dropout. Overall, the paper advances practical, scalable MARL solutions for connected autonomous driving in challenging urban settings.
Abstract
Collaborative navigation becomes essential in situations of occluded scenarios in autonomous driving where independent driving policies are likely to lead to collisions. One promising approach to address this issue is through the use of Vehicle-to-Vehicle (V2V) networks that allow for the sharing of perception information with nearby agents, preventing catastrophic accidents. In this article, we propose a collaborative control method based on a V2V network for sharing compressed LiDAR features and employing Proximal Policy Optimisation to train safe and efficient navigation policies. Unlike previous approaches that rely on expert data (behaviour cloning), our proposed approach learns the multi-agent policies directly from experience in the occluded environment, while effectively meeting bandwidth limitations. The proposed method first prepossesses LiDAR point cloud data to obtain meaningful features through a convolutional neural network and then shares them with nearby CAVs to alert for potentially dangerous situations. To evaluate the proposed method, we developed an occluded intersection gym environment based on the CARLA autonomous driving simulator, allowing real-time data sharing among agents. Our experimental results demonstrate the consistent superiority of our collaborative control method over an independent reinforcement learning method and a cooperative early fusion method.
