Sampled-data funnel control and its use for safe continual learning

Lukas Lanza; Dario Dennstädt; Karl Worthmann; Philipp Schmitz; Gökçen Devlet Şen; Stephan Trenn; Manuel Schaller

Sampled-data funnel control and its use for safe continual learning

Lukas Lanza, Dario Dennstädt, Karl Worthmann, Philipp Schmitz, Gökçen Devlet Şen, Stephan Trenn, Manuel Schaller

TL;DR

A novel sampled-data output-feedback controller for nonlinear systems of arbitrary relative degree that ensures reference tracking within prescribed error bounds and capability to serve as a safety filter for various learning-based controller designs, enabling the use of learning techniques in safety-critical applications is proposed.

Abstract

We propose a novel sampled-data output-feedback controller for nonlinear systems of arbitrary relative degree that ensures reference tracking within prescribed error bounds. We provide explicit bounds on the maximum input signal and the required uniform sampling time. A key strength of this approach is its capability to serve as a safety filter for various learning-based controller designs, enabling the use of learning techniques in safety-critical applications. We illustrate its versatility by integrating it with two different controllers: a reinforcement learning controller and a non-parametric predictive controller based on Willems et al.'s fundamental lemma. Numerical simulations illustrate effectiveness of the combined controller design.

Sampled-data funnel control and its use for safe continual learning

TL;DR

Abstract

Paper Structure (12 sections, 7 theorems, 46 equations, 12 figures, 2 algorithms)

This paper contains 12 sections, 7 theorems, 46 equations, 12 figures, 2 algorithms.

Introduction
Control objective, system class, and preliminary results
System class
Auxiliary results
Sampled-data feedback controller
Numerical example: pure ZoH feedback
Safeguarded data-based control
Data-driven MPC using Willems' fundamental lemma
Reinforcement Learning: Q-table control
Nonlinear example: Van der Pol oscillator
Conclusion and future work
Proofs of auxiliary results

Key Result

Lemma 2.1

Let $y_{\mathop{\mathrm{ref}}\limits}\in W^{r,\infty}(\mathds{R}_{\geq0},\mathds{R}^m)$, $\varphi \in \mathcal{G}$, and $y^0\in\mathcal{C}^{r-1}([-\sigma,0],\mathds{R}^m)$ with $\chi(y^0)\in\mathcal{D}_0^r$ be given. Then there exist constants $\varepsilon_k,\mu_k>0$ defined in eq:ve_mu_gam such tha for all $t\in [0,\delta)$ and for all $k=1,\ldots,r-1$.

Figures (12)

Figure 1: Error evolution in a funnel $\mathcal{F}_{\varphi}$ with boundary $1/\varphi(t)$; the figure is based on BergLe18a, edited for present purpose.
Figure 2: Mass-on-car system. The figure is based on SeifBlaj13BergIlch21.
Figure 3: Outputs, reference, and error tolerance.
Figure 4: Controls.
Figure 5: Schematic structure of the combined controller.
...and 7 more figures

Theorems & Definitions (18)

Definition 2.1
Definition 2.2
Lemma 2.1
Lemma 2.2
Theorem 3.1
Remark 3.1
Remark 3.2
Theorem 5.1
proof
Remark 5.1
...and 8 more

Sampled-data funnel control and its use for safe continual learning

TL;DR

Abstract

Sampled-data funnel control and its use for safe continual learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (18)