ProxFly: Robust Control for Close Proximity Quadcopter Flight via Residual Reinforcement Learning
Ruiqi Zhang, Dingqi Zhang, Mark W. Mueller
TL;DR
ProxFly addresses the challenge of robust close-proximity quadcopter flight under downwash disturbances by stacking a residual reinforcement learning module on top of a traditional cascaded controller. The residual policy learns to compensate for external disturbances without requiring inter-vehicle communication, using domain randomization to generalize across model variations and guiding the RL with the basic controller to improve data efficiency. The approach demonstrates significant gains in position and attitude accuracy in both simulation and real-world proximity tasks, achieving performance comparable to a complex model-based aerodynamic compensator and enabling reliable aerial docking under extreme proximity. This hybrid control framework reduces reliance on precise aerodynamics modeling, lowers data and communication requirements, and provides a practical path toward robust, multi-vehicle quadcopter operations in constrained environments.
Abstract
This paper proposes the ProxFly, a residual deep Reinforcement Learning (RL)-based controller for close proximity quadcopter flight. Specifically, we design a residual module on top of a cascaded controller (denoted as basic controller) to generate high-level control commands, which compensate for external disturbances and thrust loss caused by downwash effects from other quadcopters. First, our method takes only the ego state and controllers' commands as inputs and does not rely on any communication between quadcopters, thereby reducing the bandwidth requirement. Through domain randomization, our method relaxes the requirement for accurate system identification and fine-tuned controller parameters, allowing it to adapt to changing system models. Meanwhile, our method not only reduces the proportion of unexplainable signals from the black box in control commands but also enables the RL training to skip the time-consuming exploration from scratch via guidance from the basic controller. We validate the effectiveness of the residual module in the simulation with different proximities. Moreover, we conduct the real close proximity flight test to compare ProxFly with the basic controller and an advanced model-based controller with complex aerodynamic compensation. Finally, we show that ProxFly can be used for challenging quadcopter mid-air docking, where two quadcopters fly in extreme proximity, and strong airflow significantly disrupts flight. However, our method can stabilize the quadcopter in this case and accomplish docking. The resources are available at https://github.com/ruiqizhang99/ProxFly.
