Reinforcement Learning for Gliding Projectile Guidance and Control
Joel Cahn, Antonin Thomas, Philippe Pastor
TL;DR
The paper develops a six-degree-of-freedom reinforcement-learning framework to control a fixed-wing gliding projectile toward a camera-detected target. It builds a parametric glider model, a strapdown camera projection, and a custom Gymnasium environment to train a PPO agent that optimizes LOS-based guidance while penalizing actuator effort. Results show rapid convergence and improved performance over a classic PID baseline in windy conditions, highlighting RL's potential for autonomous glider navigation. Limitations identified include LOS-guidance weaknesses with moving targets and wind, with future work proposing more robust guidance and real-world validation.
Abstract
This paper presents the development of a control law, which is intended to be implemented on an optical guided glider. This guiding law follows an innovative approach, the reinforcement learning. This control law is used to make navigation more flexible and autonomous in a dynamic environment. The final objective is to track a target detected with the camera and then guide the glider to this point with high precision. Already applied on quad-copter drones, we wish by this study to demonstrate the applicability of reinforcement learning for fixed-wing aircraft on all of its axis.
