Reinforcement Learning-based Control of Nonlinear Systems using Carleman Approximation: Structured and Unstructured Designs

Jishnudeep Kar; He Bai; Aranya Chakrabortty

Reinforcement Learning-based Control of Nonlinear Systems using Carleman Approximation: Structured and Unstructured Designs

Jishnudeep Kar, He Bai, Aranya Chakrabortty

TL;DR

This work introduces reinforcement learning for unknown nonlinear, input-affine systems by embedding the dynamics into an infinite-dimensional Carleman space, yielding a bilinear lifted model suitable for policy-iteration RL. It develops both on-policy and off-policy learning algorithms, derives a Lyapunov-based stability framework, and analyzes the impact of finite-N truncation with explicit truncation-error bounds. To address practical constraints, the authors extend the framework to structured and sparse controllers using Riccati-like equations in Carleman space and ADMM-based sparsity promotion, respectively, while preserving closed-loop stability. Numerical experiments on a second-order oscillator and a four-boat tug network illustrate near-optimal performance, faster learning than NN-based approaches, and clear trade-offs between structure, sparsity, and control performance. The proposed framework offers a scalable, data-driven, stability-guaranteed path to nonlinear RL control with tunable complexity via truncation order.

Abstract

We develop data-driven reinforcement learning (RL) control designs for input-affine nonlinear systems. We use Carleman linearization to express the state-space representation of the nonlinear dynamical model in the Carleman space, and develop a real-time algorithm that can learn nonlinear state-feedback controllers using state and input measurements in the infinite-dimensional Carleman space. Thereafter, we study the practicality of having a finite-order truncation of the control signal, followed by its closed-loop stability analysis. Finally, we develop two additional designs that can learn structured as well as sparse representations of the RL-based nonlinear controller, and provide theoretical conditions for ensuring their closed-loop stability. We present numerical examples to show how our proposed method generates closed-loop responses that are close to the optimal performance of the nonlinear plant. We also compare our designs to other data-driven nonlinear RL control methods such as those based on neural networks, and illustrate their relative advantages and drawbacks.

Reinforcement Learning-based Control of Nonlinear Systems using Carleman Approximation: Structured and Unstructured Designs

TL;DR

Abstract

Paper Structure (31 sections, 12 theorems, 108 equations, 9 figures, 2 tables, 2 algorithms)

This paper contains 31 sections, 12 theorems, 108 equations, 9 figures, 2 tables, 2 algorithms.

INTRODUCTION
Problem Formulation
Carleman Linearization
State-space representation in Carleman space
Derivation of Lyapunov equation
RL Control for Lifted System
Policy iteration for Carleman systems
Theoretical Results
Convergence
$\mathbf{N}^{th}$ order truncation
LQR control objective and policy iteration
Reinforcement learning for the truncated Carleman model
Stability analysis for truncated approximation
Stability analysis for infinite-dimensional system
Online implementation
...and 16 more sections

Key Result

Theorem 1

Given that $(\mathcal{A},Q^{1/2})$ is controllable, if $K$ is the optimal controller minimizing the objective function eqn:obj for the system eqn:infcarl, then $K$ must satisfy the Lyapunov-like infinite-dimensional equation for a symmetric matrix $P$ given by where $\mathcal{A}_{cl} \psi = \mathcal{A} \psi - B_{\psi} K\psi$.

Figures (9)

Figure 1: Flowchart for stability proof of truncated controller
Figure 2: Learning of the $2^{nd}$ order RL controller
Figure 3: Comparison of learnt controller with HJB solution
Figure 4: Learning and control for $\mathbf{N}=2$ Carleman controller
Figure 5: Convergence for varying $Q$
...and 4 more figures

Theorems & Definitions (25)

proof
Theorem 1
proof
Theorem 2
proof
Theorem 3
proof
Lemma 4
proof
Theorem 5
...and 15 more

Reinforcement Learning-based Control of Nonlinear Systems using Carleman Approximation: Structured and Unstructured Designs

TL;DR

Abstract

Reinforcement Learning-based Control of Nonlinear Systems using Carleman Approximation: Structured and Unstructured Designs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (25)