Robust Direct Data-Driven Control for Probabilistic Systems

Alexander von Rohr; Dmitrii Likhachev; Sebastian Trimpe

Robust Direct Data-Driven Control for Probabilistic Systems

Alexander von Rohr, Dmitrii Likhachev, Sebastian Trimpe

TL;DR

This work derives lower bounds on the amount of data required to achieve quadratic stability for probabilistic systems with aleatoric uncertainty, and finds that the learned controllers generalize well to high variations in the dynamics even when based on only a few short open-loop trajectories.

Abstract

We propose a data-driven control method for systems with aleatoric uncertainty, for example, robot fleets with variations between agents. Our method leverages shared trajectory data to increase the robustness of the designed controller and thus facilitate transfer to new variations without the need for prior parameter and uncertainty estimations. In contrast to existing work on experience transfer for performance, our approach focuses on robustness and uses data collected from multiple realizations to guarantee generalization to unseen ones. Our method is based on scenario optimization combined with recent formulations for direct data-driven control. We derive lower bounds on the amount of data required to achieve quadratic stability for probabilistic systems with aleatoric uncertainty and demonstrate the benefits of our data-driven method through a numerical example. We find that the learned controllers generalize well to high variations in the dynamics even when based on only a few short open-loop trajectories. Robust experience transfer enables the design of safe and robust controllers that work out of the box without any additional learning during deployment.

Robust Direct Data-Driven Control for Probabilistic Systems

TL;DR

Abstract

Paper Structure (13 sections, 3 theorems, 15 equations, 5 figures)

This paper contains 13 sections, 3 theorems, 15 equations, 5 figures.

Introduction
Related Work
Problem Formulation
Preliminaries
Data Informativity in Direct Data-Driven Control
Robust State Feedback
Stabilization for Finite Sets of Systems
Stabilization of Probabilistic Systems
Numerical Example
Probabilistic Linear System
Generalization and Feasibility of the Synthesis
Effect of the Trajectory Length
Conclusion

Key Result

Lemma 1

Assume the generalized Slater condition eq:gen_slater holds. Then the data $\tau_i$ is informative for quadratic stabilization if and only if there exists $P \in \mathbb{R}^{ d \times d }$ with $P = P^\top \succ 0$, $L \in \mathbb{R}^{ d \times d }$ and scalars $a \geq 0$, $b > 0$ satisfying If $P$ and $L$ satisfy eq:lmistab, then $K := L P^{-1}$ is a stabilizing feedback controller for all $\the

Figures (5)

Figure 1: Sketch of the proposed method: A controller is based on a set of collected trajectory data from $N$ systems with variations (here symbolized as quadcopters). The resulting controller is guaranteed to work on any unseen system drawn from the same probability distribution with high probability.
Figure 2: Illustrative example of scenario optimization for a one-dimensional probabilistic system: Resulting controller robustly stabilizes the union of the sets $\Sigma_{\tau_{1}}\dots \Sigma_{\tau_{4}}$. The more samples are available, the more likely it is that the controller stabilizes a sample from $\Theta$ (cf. Theorem \ref{['th:scenario_rddc']}). The fraction of $\Theta$ that remains unstable corresponds to $\alpha$.
Figure 3: Synthetic example with different number of observed systems and increasing variance of the probabilistic system: The probability that the controller stabilizes a sample from the probabilistic system increases as more samples become available. For high variance the uncertainty set becomes too large and the synthesis returns no controller ($\times$). The frequency of stability is averaged over $50$ different data sets and the size of the dots indicates the average frequency at which the data is informative.
Figure 4: Synthetic example with different number of observed systems and varying trajectory length ($\sigma^2=0.1$): The robustness is largely unaffected by the trajectory length but longer trajectories increase data informativity and the method is able to find controllers more often.
Figure 5: Uncertainty sets for varying trajectory lengths: The columns represent rollouts with different noise realizations. For short trajectories, the form, the position, and the size of the uncertainty set vary significantly across different rollouts. Longer trajectories provide more consistent uncertainty sets.

Theorems & Definitions (11)

Definition 1: Probabilistic linear system
Definition 2: $\alpha$-probabilistic robust controller
Definition 3: Generalized Slater's condition
Remark 1
Lemma 1: waarde2022noisy
Definition 4: Informativity for quadratic stabilization of finite sets of systems
Theorem 1
proof
Definition 5: Informativity for $\alpha$-probabilistic quadratic stabilization
Theorem 2
...and 1 more

Robust Direct Data-Driven Control for Probabilistic Systems

TL;DR

Abstract

Robust Direct Data-Driven Control for Probabilistic Systems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (11)