Flying in Highly Dynamic Environments with End-to-end Learning Approach
Xiyu Fan, Minghao Lu, Bowen Xu, Peng Lu
TL;DR
The paper tackles autonomous obstacle avoidance for quadrotors in highly dynamic cluttered environments, where static planners struggle. It introduces an end-to-end framework that encodes lidar point clouds into a fixed-size 2D obstacle map and trains a neural policy to output horizontal acceleration commands using reinforcement learning. Key contributions include a novel lidar data encoding scheme, end-to-end training that handles both static and dynamic obstacles, and extensive simulations and real-world demonstrations showing reduced latency and robust high-speed obstacle avoidance. The approach offers portable on-board perception-to-action capabilities suitable for real-time operation in cluttered environments, with future work extending to full 3D maneuvers and stability improvements.
Abstract
Obstacle avoidance for unmanned aerial vehicles like quadrotors is a popular research topic. Most existing research focuses only on static environments, and obstacle avoidance in environments with multiple dynamic obstacles remains challenging. This paper proposes a novel deep-reinforcement learning-based approach for the quadrotors to navigate through highly dynamic environments. We propose a lidar data encoder to extract obstacle information from the massive point cloud data from the lidar. Multi frames of historical scans will be compressed into a 2-dimension obstacle map while maintaining the obstacle features required. An end-to-end deep neural network is trained to extract the kinematics of dynamic and static obstacles from the obstacle map, and it will generate acceleration commands to the quadrotor to control it to avoid these obstacles. Our approach contains perception and navigating functions in a single neural network, which can change from a navigating state into a hovering state without mode switching. We also present simulations and real-world experiments to show the effectiveness of our approach while navigating in highly dynamic cluttered environments.
