Towards Task-Oriented Flying: Framework, Infrastructure, and Principles
Kangyao Huang, Hao Wang, Jingyu Chen, Jintao Chen, Yu Luo, Di Guo, Xiangkui Zhang, Xiangyang Ji, Huaping Liu
TL;DR
The paper presents a task-oriented end-to-end DRL framework for quadrotors, combined with an open-source, full-stack infrastructure (AirGym, AirGym-Real, rlPx4Controller) to enable rapid training and zero-shot sim-to-real deployment. It codifies design principles for task specification, perception, and training, and validates with four tasks—tracking, obstacle avoidance, high-speed maneuvers, and forest navigation—demonstrating robust performance under real-world disturbances. Key contributions include a principled framework linking simulation and hardware deployment, a scalable workflow for training and transfer, and detailed strategies for system identification, hover throttle learning, action continuity, trajectory guidance, and domain randomization. The work lowers entry barriers for practitioners to deploy learning-based controllers on aerial robots and provides a practical foundation for autonomously navigating dynamic, unstructured environments.
Abstract
Deploying robot learning methods to aerial robots in unstructured environments remains both challenging and promising. While recent advances in deep reinforcement learning (DRL) have enabled end-to-end flight control, the field still lacks systematic design guidelines and a unified infrastructure to support reproducible training and real-world deployment. We present a task-oriented framework for end-to-end DRL in quadrotors that integrates design principles for complex task specification and reveals the interdependencies among simulated task definition, training design principles, and physical deployment. Our framework involves software infrastructure, hardware platforms, and open-source firmware to support a full-stack learning infrastructure and workflow. Extensive empirical results demonstrate robust flight and sim-to-real generalization under real-world disturbances. By reducing the entry barrier for deploying learning-based controllers on aerial robots, our work lays a practical foundation for advancing autonomous flight in dynamic and unstructured environments.
