RaCIL: Ray Tracing based Multi-UAV Obstacle Avoidance through Composite Imitation Learning
Harsh Bansal, Vyom Goyal, Bhaskar Joshi, Akhil Gupta, Harikumar Kandath
TL;DR
This work tackles UAV obstacle avoidance in dynamic, multi-UAV environments by proposing RaCIL, a composite imitation learning framework that fuses PPO with Behavioral Cloning and Generative Adversarial Imitation Learning, enhanced by ray-tracing based observations. The approach demonstrates that incorporating ray tracing improves obstacle detection and accelerates training, while GAIL promotes coordinated flight among multiple UAVs, yielding higher success rates than BC alone. Evaluation in Unity ML-Agents shows scalable performance from 1 to 3 UAVs, with notable gains in reward and safety metrics, suggesting practical potential for autonomous UAV operations in crowded or dynamic scenarios. The findings highlight the value of combining ray-tracing perception with imitation learning to achieve robust, scalable UAV navigation, with future work targeting 3D extension and real-world deployment.
Abstract
In this study, we address the challenge of obstacle avoidance for Unmanned Aerial Vehicles (UAVs) through an innovative composite imitation learning approach that combines Proximal Policy Optimization (PPO) with Behavior Cloning (BC) and Generative Adversarial Imitation Learning (GAIL), enriched by the integration of ray-tracing techniques. Our research underscores the significant role of ray-tracing in enhancing obstacle detection and avoidance capabilities. Moreover, we demonstrate the effectiveness of incorporating GAIL in coordinating the flight paths of two UAVs, showcasing improved collision avoidance capabilities. Extending our methodology, we apply our combined PPO, BC, GAIL, and ray-tracing framework to scenarios involving four UAVs, illustrating its scalability and adaptability to more complex scenarios. The findings indicate that our approach not only improves the reliability of basic PPO based obstacle avoidance but also paves the way for advanced autonomous UAV operations in crowded or dynamic environments.
