Learning Environment for the Air Domain (LEAD)
Andreas Strand, Patrick Gorton, Martin Asprusten, Karsten Brathen
TL;DR
LEAD introduces a modular, Gymnasium-based learning environment for air-domain CGFs that interworks with third-party simulators through distributed simulation. It combines a fast, replaceable Simulation System (SACS) with an Agent Gateway, Simulation Interpreter, and a Distributed Simulation Service to enable RL and IL for realistic air-combat behavior. The authors demonstrate PPO-based formation-flight learning in LEAD, achieving robust wingman formation under varying lead maneuvers and random initial conditions, with insights into training dynamics and scalability. This work advances practical, scalable workflows for training adaptive CGFs across simulations of different fidelities, potentially improving pilot training with intelligent autonomous adversaries.
Abstract
A substantial part of fighter pilot training is simulation-based and involves computer-generated forces controlled by predefined behavior models. The behavior models are typically manually created by eliciting knowledge from experienced pilots, which is a time-consuming process. Despite the work put in, the behavior models are often unsatisfactory due to their predictable nature and lack of adaptivity, forcing instructors to spend time manually monitoring and controlling them. Reinforcement and imitation learning pose as alternatives to handcrafted models. This paper presents the Learning Environment for the Air Domain (LEAD), a system for creating and integrating intelligent air combat behavior in military simulations. By incorporating the popular programming library and interface Gymnasium, LEAD allows users to apply readily available machine learning algorithms. Additionally, LEAD can communicate with third-party simulation software through distributed simulation protocols, which allows behavior models to be learned and employed using simulation systems of different fidelities.
