A Reduced Order Iterative Linear Quadratic Regulator (ILQR) Technique for the Optimal Control of Nonlinear Partial Differential Equations
Aayushman Sharma, Suman Chakravorty
TL;DR
The paper tackles optimal control of nonlinear PDEs by integrating POD-based model order reduction with ILQR in an iterative reduced-order loop (RO-ILQR). It formulates a reduced, time-varying linear model around the current trajectory and solves a time-varying reduced-order LQR to update the trajectory and basis, repeating until convergence. The authors provide a convergence analysis showing the method approaches a limit set determined by truncation error and demonstrate significant computational savings on Burgers and phase-field PDEs with competitive performance relative to full ILQR. Empirical results indicateRO-ILQR reduces dimensionality from n_x+n_u to n_α+n_u, with the open-loop cost within about 14% of the true optimum and notable speedups, while outperforming a deep RL baseline that struggles to converge on these problems.
Abstract
In this paper, we introduce a reduced order model-based reinforcement learning (MBRL) approach, utilizing the Iterative Linear Quadratic Regulator (ILQR) algorithm for the optimal control of nonlinear partial differential equations (PDEs). The approach proposes a novel modification of the ILQR technique: it uses the Method of Snapshots to identify a reduced order Linear Time Varying (LTV) approximation of the nonlinear PDE dynamics around a current estimate of the optimal trajectory, utilizes the identified LTV model to solve a time-varying reduced order LQR problem to obtain an improved estimate of the optimal trajectory along with a new reduced basis, and iterates till convergence. The convergence behavior of the reduced order approach is analyzed and the algorithm is shown to converge to a limit set that is dependent on the truncation error in the reduction. The proposed approach is tested on the viscous Burger's equation and two phase-field models for microstructure evolution in materials, and the results show that there is a significant reduction in the computational burden over the standard ILQR approach, without significantly sacrificing performance.
