3DIOC: Direct Data-Driven Inverse Optimal Control for LTI Systems
Chendi Qu, Jianping He, Xiaoming Duan
TL;DR
This work tackles inverse optimal control for linear time-invariant systems by learning the finite-horizon LQ objective weights $Q$ and $R$ directly from input-output trajectories, without identifying the system dynamics. It builds a data-enabled, model-free IOC framework using the Fundamental Lemma to obtain an input-output representation and derives model-free KKT conditions that connect data blocks to the unknown $Q$ and $R$. The authors propose Vanilla and Simplified 3DIOC formulations with identifiability and perturbation analyses, including a special LQR-IOC case, and demonstrate computational efficiency and robustness through simulations. The approach reduces data and computation while providing guarantees on identifiability and sensitivity to noise, with potential extensions to process-noise and Koopman-based nonlinear settings.
Abstract
This paper develops a direct data-driven inverse optimal control (3DIOC) algorithm for the linear time-invariant (LTI) system who conducts a linear quadratic (LQ) control, where the underlying objective function is learned directly from measured input-output trajectories without system identification. By introducing the Fundamental Lemma, we establish the input-output representation of the LTI system. We accordingly propose a model-free optimality necessary condition for the forward LQ problem to build a connection between the objective function and collected data, with which the inverse optimal control problem is solved. We further improve the algorithm so that it requires a less computation and data. Identifiability condition and perturbation analysis are provided. Simulations demonstrate the efficiency and performance of our algorithms.
