Online Learning-Based Predictive Control for Nonlinear System
Yuanqing Zhang, Huanshui Zhang
TL;DR
The paper addresses predictive control for nonlinear systems without explicit dynamics by fusing reinforcement learning with model-free online optimization. An RL-based LPC solver is embedded within an OCP framework to deliver real-time, Hessian-free updates and guarantee super-linear convergence, even when Hessians are singular. The approach is validated across linear, nonlinear, and trajectory-tracking tasks, demonstrating stability, faster convergence, and practical online feasibility with runtimes around a few hundredths of a second per horizon. This data-driven, Hessian-robust LPC offers a scalable alternative to traditional MPC when models are unavailable or costly to identify, with clear advantages in computation efficiency and robustness.
Abstract
In this paper, we propose an online learning-based predictive control (LPC) approach designed for nonlinear systems that lack explicit system dynamics. Unlike traditional model predictive control (MPC) algorithms that rely on known system models to optimize controller outputs, our proposed algorithm integrates a reinforcement learning component to learn optimal policies in real time from the offline dataset and real-time data. Additionally, an optimal control problem (OCP)-based optimization framework is incorporated to enhance real-time computational efficiency while ensuring stability during online operation. Moreover, we rigorously establish the super-linear convergence properties of the algorithm. Finally, extensive simulations are performed to evaluate the feasibility and effectiveness of the proposed approach.
