RL-based Control of UAS Subject to Significant Disturbance
Kousheek Chakraborty, Thijs Hof, Ayham Alharbat, Abeje Mersha
TL;DR
This work tackles proactive disturbance rejection for quadrotor UAS subjected to predictable impulse recoil, such as from a water cannon. It introduces a reinforcement learning framework that augments the policy's observations with a warning trigger signal, enabling preemptive stabilization before the disturbance occurs. Through PPO-based training in a high-fidelity simulator with domain randomization, three policies are compared: a nominal policy, an impulse-aware policy without the trigger, and a predictive policy that utilizes the trigger. The predictive policy demonstrates superior performance by initiating preemptive corrections, reducing position error and control effort relative to reactive or baselined approaches. The results highlight the potential of predictive cues to enhance RL-based control in disturbance-prone aerial systems and suggest directions for real-world validation and generalization to other platforms and disturbance types.
Abstract
This paper proposes a Reinforcement Learning (RL)-based control framework for position and attitude control of an Unmanned Aerial System (UAS) subjected to significant disturbance that can be associated with an uncertain trigger signal. The proposed method learns the relationship between the trigger signal and disturbance force, enabling the system to anticipate and counteract the impending disturbances before they occur. We train and evaluate three policies: a baseline policy trained without exposure to the disturbance, a reactive policy trained with the disturbance but without the trigger signal, and a predictive policy that incorporates the trigger signal as an observation and is exposed to the disturbance during training. Our simulation results show that the predictive policy outperforms the other policies by minimizing position deviations through a proactive correction maneuver. This work highlights the potential of integrating predictive cues into RL frameworks to improve UAS performance.
