High-Speed Vision-Based Flight in Clutter with Safety-Shielded Reinforcement Learning
Jiarui Zhang, Chengyong Lei, Chengjiang Dai, Lijie Wang, Zhichao Han, Fei Gao
TL;DR
This work addresses the challenge of fast, safe navigation for quadrotors in cluttered environments by marrying end to end reinforcement learning with model based safety. It introduces a geodesic based reward shaping during training and a real time HOCBF based safety filter during deployment, enabling high speed flight with formal collision avoidance. Key contributions include a Dijkstra derived navigation potential, ESDF based safety shaping, a HOCBF corrected action layer, and robust sim to real transfer validated across simulation and real world flights up to $7.5$ m/s. The approach outperforms traditional planners and learning baselines while maintaining strong sim to real transfer, highlighting a practical path to agile yet safe autonomous flight in real environments.
Abstract
Quadrotor unmanned aerial vehicles (UAVs) are increasingly deployed in complex missions that demand reliable autonomous navigation and robust obstacle avoidance. However, traditional modular pipelines often incur cumulative latency, whereas purely reinforcement learning (RL) approaches typically provide limited formal safety guarantees. To bridge this gap, we propose an end-to-end RL framework augmented with model-based safety mechanisms. We incorporate physical priors in both training and deployment. During training, we design a physics-informed reward structure that provides global navigational guidance. During deployment, we integrate a real-time safety filter that projects the policy outputs onto a provably safe set to enforce strict collision-avoidance constraints. This hybrid architecture reconciles high-speed flight with robust safety assurances. Benchmark evaluations demonstrate that our method outperforms both traditional planners and recent end-to-end obstacle avoidance approaches based on differentiable physics. Extensive experiments demonstrate strong generalization, enabling reliable high-speed navigation in dense clutter and challenging outdoor forest environments at velocities up to 7.5m/s.
