Learning Speed Adaptation for Flight in Clutter
Guangyu Zhao, Tianyue Wu, Yeke Chen, Fei Gao
TL;DR
The paper tackles safe, efficient flight in unknown clutter by introducing a hierarchical framework where an outer-loop RL policy learns a speed constraint $v^{\dagger}$ to guide a robust model-based planner. This two-tier approach yields perception-aware, aggressive yet safe trajectories, outperforming constant-speed baselines and an EVA-planner in both simulation and real-world tests. The key novelty lies in the two-stage reward design that stabilizes learning amid stochastic termination and perception latency, coupled with a local occupancy-map representation that enables perception-aware behaviors. The results indicate significant practical impact for fast, reliable autonomous flight in cluttered environments and highlight avenues for giving the learned policy more control while preserving the safety guarantees of traditional planners.
Abstract
Animals learn to adapt speed of their movements to their capabilities and the environment they observe. Mobile robots should also demonstrate this ability to trade-off aggressiveness and safety for efficiently accomplishing tasks. The aim of this work is to endow flight vehicles with the ability of speed adaptation in prior unknown and partially observable cluttered environments. We propose a hierarchical learning and planning framework where we utilize both well-established methods of model-based trajectory generation and trial-and-error that comprehensively learns a policy to dynamically configure the speed constraint. Technically, we use online reinforcement learning to obtain the deployable policy. The statistical results in simulation demonstrate the advantages of our method over the constant speed constraint baselines and an alternative method in terms of flight efficiency and safety. In particular, the policy behaves perception awareness, which distinguish it from alternative approaches. By deploying the policy to hardware, we verify that these advantages can be brought to the real world.
