Table of Contents
Fetching ...

Learning-Based Model Predictive Control for Piecewise Affine Systems with Feasibility Guarantees

Samuel Mallick, Azita Dabiri, Bart De Schutter

TL;DR

This work tackles online MPC for piecewise affine systems, where solving for the switching sequence can be computationally prohibitive. It introduces a learning-based MPC that offline trains a classifier to map states to a PWA region sequence, reducing online operation to a linear program while guaranteeing feasibility of the policy output. A convex-partition-based structure, an iterative data-generation procedure, and a tightened-MPC feasibility test underpin the approach, yielding substantial offline and online efficiency gains relative to explicit and online MPC. Numerical results show the method achieves favorable suboptimality concentrated near partition boundaries and significantly lower online computation, enabling scalable real-time control for PWA dynamics with feasibility guarantees.

Abstract

Online model predictive control (MPC) for piecewise affine (PWA) systems requires the online solution to an optimization problem that implicitly optimizes over the switching sequence of PWA regions, for which the computational burden can be prohibitive. Alternatively, the computation can be moved offline using explicit MPC; however, the online memory requirements and the offline computation can then become excessive. In this work we propose a solution in between online and explicit MPC, addressing the above issues by partially dividing the computation between online and offline. To solve the underlying MPC problem, a policy, learned offline, specifies the sequence of PWA regions that the dynamics must follow, thus reducing the complexity of the remaining optimization problem that solves over only the continuous states and control inputs. We provide a condition, verifiable during learning, that guarantees feasibility of the learned policy's output, such that an optimal continuous control input can always be found online. Furthermore, a method for iteratively generating training data offline allows the feasible policy to be learned efficiently, reducing the offline computational burden. A numerical experiment demonstrates the effectiveness of the method compared to both online and explicit MPC.

Learning-Based Model Predictive Control for Piecewise Affine Systems with Feasibility Guarantees

TL;DR

This work tackles online MPC for piecewise affine systems, where solving for the switching sequence can be computationally prohibitive. It introduces a learning-based MPC that offline trains a classifier to map states to a PWA region sequence, reducing online operation to a linear program while guaranteeing feasibility of the policy output. A convex-partition-based structure, an iterative data-generation procedure, and a tightened-MPC feasibility test underpin the approach, yielding substantial offline and online efficiency gains relative to explicit and online MPC. Numerical results show the method achieves favorable suboptimality concentrated near partition boundaries and significantly lower online computation, enabling scalable real-time control for PWA dynamics with feasibility guarantees.

Abstract

Online model predictive control (MPC) for piecewise affine (PWA) systems requires the online solution to an optimization problem that implicitly optimizes over the switching sequence of PWA regions, for which the computational burden can be prohibitive. Alternatively, the computation can be moved offline using explicit MPC; however, the online memory requirements and the offline computation can then become excessive. In this work we propose a solution in between online and explicit MPC, addressing the above issues by partially dividing the computation between online and offline. To solve the underlying MPC problem, a policy, learned offline, specifies the sequence of PWA regions that the dynamics must follow, thus reducing the complexity of the remaining optimization problem that solves over only the continuous states and control inputs. We provide a condition, verifiable during learning, that guarantees feasibility of the learned policy's output, such that an optimal continuous control input can always be found online. Furthermore, a method for iteratively generating training data offline allows the feasible policy to be learned efficiently, reducing the offline computational burden. A numerical experiment demonstrates the effectiveness of the method compared to both online and explicit MPC.

Paper Structure

This paper contains 13 sections, 4 theorems, 9 equations, 3 figures, 3 tables, 2 algorithms.

Key Result

Proposition 1

Consider the problem $\bm{\delta}^\ast(x) \in \mathop{\mathrm{arg\,min}}\limits_{\bm{\delta}} J(x, \bm{\delta})$ for $x \in \mathcal{X}_0$. Define $\mathcal{L}_i = \{\bm{\delta} \in \mathcal{Z}_{[0, l]}^{N+1} | \delta(0) = i\}$. Then

Figures (3)

  • Figure 1: Illustration of the proof of Proposition \ref{['prop:1']}. $\mathcal{R}_{1, 1}$ and $\mathcal{R}_{1, 2}$ are associated with $\bm{\delta}^\ast(x) = \bm{\delta}_a$, $\mathcal{R}_{1, 3}$ is associated with $\bm{\delta}^\ast(x) = \bm{\delta}_b$, and $\mathcal{R}_{1, 4}$ and $\mathcal{R}_{1, 5}$ are associated with $\bm{\delta}^\ast(x) = \bm{\delta}_c$.
  • Figure 2: Training for $N=12$. Each color represents a different $\bm{\delta}$, while the dark gray region represents infeasibility.
  • Figure 3: Open-loop suboptimality $\Delta J$ for $N=12$.

Theorems & Definitions (10)

  • Proposition 1
  • proof
  • Proposition 2
  • proof
  • Remark 1
  • Proposition 3
  • proof
  • Remark 2
  • Proposition 4
  • proof