Table of Contents
Fetching ...

Flexible-step MPC for Unknown Linear Time-Invariant Systems

Markus Pietschner, Christian Ebenbauer, Bahman Gharesifard, Raik Suttner

TL;DR

The paper tackles stabilization of unknown discrete-time LTI systems using MPC without prior data. It introduces a flexible-step MPC framework that applies a variable number of optimized inputs per iteration and enforces stability with a generalized discrete-time control Lyapunov function (GDCLF) inside the OCP, effectively decoupling optimization from stabilization. For unknown systems, online data-driven estimates of $(A,B)$ are maintained and updated through a data list and a PE-based exploration mechanism, with convergence guarantees showing that, under feasibility, the state converges to the origin. Simulations on a hard-to-learn system demonstrate fast exponential convergence and robustness to measurement noise, achieving stabilization with minimal exploratory disturbance. The approach offers a practical, data-driven path to stabilize unknown LTI systems and suggests avenues for extensions to nonlinear or structured-model settings.

Abstract

We propose a novel flexible-step model predictive control algorithm for unknown linear time-invariant discrete-time systems. The goal is to asymptotically stabilize the system without relying on a pre-collected dataset that describes its behavior in advance. In particular, we aim to avoid a potentially harmful initial open-loop exploration phase for identification, since full identification is often not necessary for stabilization. Instead, the proposed control scheme explores and learns the unknown system online through measurements of inputs and states. The measurement results are used to update the prediction model in the finite-horizon optimal control problem. If the current prediction model yields an infeasible optimal control problem, then persistently exciting inputs are applied until feasibility is reestablished. The proposed flexible-step approach allows for a flexible number of implemented optimal input values in each iteration, which is beneficial for simultaneous exploration and exploitation. A generalized control Lyapunov function is included into the constraints of the optimal control problem to enforce stability. This way, the problem of optimization is decoupled from the problem of stabilization. For an asymptotically stabilizable unknown control system, we prove that the proposed flexible-step algorithm can lead to global convergence of the system state to the origin.

Flexible-step MPC for Unknown Linear Time-Invariant Systems

TL;DR

The paper tackles stabilization of unknown discrete-time LTI systems using MPC without prior data. It introduces a flexible-step MPC framework that applies a variable number of optimized inputs per iteration and enforces stability with a generalized discrete-time control Lyapunov function (GDCLF) inside the OCP, effectively decoupling optimization from stabilization. For unknown systems, online data-driven estimates of are maintained and updated through a data list and a PE-based exploration mechanism, with convergence guarantees showing that, under feasibility, the state converges to the origin. Simulations on a hard-to-learn system demonstrate fast exponential convergence and robustness to measurement noise, achieving stabilization with minimal exploratory disturbance. The approach offers a practical, data-driven path to stabilize unknown LTI systems and suggests avenues for extensions to nonlinear or structured-model settings.

Abstract

We propose a novel flexible-step model predictive control algorithm for unknown linear time-invariant discrete-time systems. The goal is to asymptotically stabilize the system without relying on a pre-collected dataset that describes its behavior in advance. In particular, we aim to avoid a potentially harmful initial open-loop exploration phase for identification, since full identification is often not necessary for stabilization. Instead, the proposed control scheme explores and learns the unknown system online through measurements of inputs and states. The measurement results are used to update the prediction model in the finite-horizon optimal control problem. If the current prediction model yields an infeasible optimal control problem, then persistently exciting inputs are applied until feasibility is reestablished. The proposed flexible-step approach allows for a flexible number of implemented optimal input values in each iteration, which is beneficial for simultaneous exploration and exploitation. A generalized control Lyapunov function is included into the constraints of the optimal control problem to enforce stability. This way, the problem of optimization is decoupled from the problem of stabilization. For an asymptotically stabilizable unknown control system, we prove that the proposed flexible-step algorithm can lead to global convergence of the system state to the origin.

Paper Structure

This paper contains 9 sections, 3 theorems, 24 equations, 1 figure, 2 algorithms.

Key Result

Theorem 1

Suppose ass:feasibility is satisfied. Then, system eq:unknownSystem under algo:KLTI is globally exponentially stable.

Figures (1)

  • Figure 1: Simulation results for an unknown LTI system under flexible-step MPC. Upper plot: Euclidean norm of the system state as a function of time. Lower plot: Deviation of the prediction model from the unknown system. The blue dots represent the results generated by \ref{['algo:ULTI']}. For comparison, the orange dots represent the results for the same choice of parameters, but with an additional exploration of the unknown system by persistently exciting (PE) inputs prior to the application of \ref{['algo:ULTI']}. The pre-exploration by PE inputs lasts from the initial time $t=0$ until $t=15$. A direct application of \ref{['algo:ULTI']} without pre-exploration (blue dots) only involves one PE input value at time $t=1$. For further comparison, the green dots represent the results generated by \ref{['algo:ULTI']} for the same choice of parameters but under the additional influence of measurement noise. Again, only one PE input at time $t=1$ is applied.

Theorems & Definitions (15)

  • Remark 1
  • Remark 2
  • Remark 3: Choice of Parameters
  • Theorem 1
  • proof
  • Definition 1
  • Remark 4
  • Example 1
  • Definition 2
  • Example 2
  • ...and 5 more