Nonlinear Optimal Guidance for Impact Time Control with Field-of-View Constraint
Fangmin Lu, Zheng Chen, Kun Wang
TL;DR
This work tackles impact time control with Field-of-View constraints by converting the inequality-constrained nonlinear OCP into an equality-constrained problem using a saturation function, then deriving necessary optimality conditions via Pontryagin's Maximum Principle. A parameterized extremal system is constructed so that varying a few trajectory parameters generates rich, optimal-like data, which is used to train a neural network that maps the current polar state $\left(r,\sigma\right)$ and time-to-go $t_g$ to an optimal guidance command $u$ in real time. The neural network, with a compact two-hidden-layer architecture, delivers guidance commands in about $1\times 10^{-2}$ seconds and respects the FOV bound while achieving nearly optimal control effort, as demonstrated across multiple simulations and comparisons with existing PN- and SMC-based ITCG methods. This approach enables real-time, FOV-compliant, nonlinear optimal guidance and provides a scalable framework for extending to higher dimensions. $|\,\sigma\,|\leq\sigma_M$ and related PMP conditions underpin the core methodology.
Abstract
An optimal guidance law for impact time control with field-of-view constraint is presented. The guidance law is derived by first converting the inequality-constrained nonlinear optimal control problem into an equality-constrained one through a saturation function. Based on Pontryagin's maximum principle, a parameterized system satisfying the necessary optimality conditions is established. By propagating this system, a large number of extremal trajectories can be efficiently generated. These trajectories are then used to train a neural network that maps the current state and time-to-go to the optimal guidance command. The trained neural network can generate optimal commands within 0.1 milliseconds while satisfying the field-of-view constraint. Numerical simulations demonstrate that the proposed guidance law outperforms existing methods and achieves nearly optimal performance in terms of control effort.
