Formal Entropy-Regularized Control of Stochastic Systems

Menno van Zutphen; Giannis Delimpaltadakis; Duarte J. Antunes

Formal Entropy-Regularized Control of Stochastic Systems

Menno van Zutphen, Giannis Delimpaltadakis, Duarte J. Antunes

TL;DR

This work obtains bounds on the entropy of system discretizations using traditional formal-abstractions results, and derives an additional bound on the difference between the entropy of a continuous distribution and that of its discretization, which enables formal entropy-aware controller synthesis that trades predictability against control performance while preserving formal guarantees for the original continuous system.

Abstract

Analyzing and controlling system entropy is a powerful tool for regulating predictability of control systems. Applications benefiting from such approaches range from reinforcement learning and data security to human-robot collaboration. In continuous-state stochastic systems, accurate entropy analysis and control remains a challenge. In recent years, finite-state abstractions of continuous systems have enabled control synthesis with formal performance guarantees on objectives such as stage costs. However, these results do not extend to entropy-based performance measures. We solve this problem by first obtaining bounds on the entropy of system discretizations using traditional formal-abstractions results, and then obtaining an additional bound on the difference between the entropy of a continuous distribution and that of its discretization. The resulting theory enables formal entropy-aware controller synthesis that trades predictability against control performance while preserving formal guarantees for the original continuous system. More specifically, we focus on minimizing the linear combination of the KL divergence of the system trajectory distribution to uniform -- our system entropy metric -- and a generic cumulative cost. We note that the bound we derive on the difference between the KL divergence to uniform of a given continuous distribution and its discretization can also be relevant in more general information-theoretic contexts. A set of case studies illustrates the effectiveness of the method.

Formal Entropy-Regularized Control of Stochastic Systems

TL;DR

Abstract

Paper Structure (25 sections, 13 theorems, 107 equations, 5 figures, 2 algorithms)

This paper contains 25 sections, 13 theorems, 107 equations, 5 figures, 2 algorithms.

Introduction
Markov chains and entropy
Markov chains
Trajectory distribution discretization
Interval Markov chain abstraction
Entropy quantification
Formally Bounding System Entropy
Bound construction
Considerations and algorithms
Formal Entropy-Regularized Control
Basic preliminaries
Formal entropy-regularized bound optimization
Examples
Markov Chain Entropy Bounds
Markov Decision Process Controller Synthesis
...and 10 more sections

Key Result

Lemma 1

For a distribution density $T$ defined on a connected compact set $\mathcal{S}\subset\mathbb{R}^{n}$ and its discretization $p_{t}:=\int_{\mathcal{S}_{t}}T(s)~\mathrm{d}s$ over partition $\bigcup_{t\in S} \mathcal{S}_{t} = \mathcal{S}$, $\lambda(\mathcal{S}_{t}\cup \mathcal{S}_{t^{\prime}})=0$ for a where $T_{D}$ is the piece-wise continuous representation of $p$, defined as where $\mathbbm{1}_{\

Figures (5)

Figure 1: Discretization $(\mathcal{S}_t)_{t\in S}$ of the trajectory space $\mathcal{S}$, as induced by a hyperrectangular discretization $(\mathcal{X}_{i})_{i\in X}$ of the state space $\mathcal{X}$.
Figure 2: The objects that form a formal lower bound on the KL divergence to uniform of system $\mathcal{M}$, i.e., $\operatorname{KL}(T\| U)$, our metric total system entropy.
Figure 3: The objects that together form formal upper bounds on the KL divergence to uniform of system $\mathcal{M}$, i.e., $\operatorname{KL}(T\| U)$, our metric total system entropy.
Figure 4: KL divergence to uniform abstraction bounds as a function of discretization resolution as expressed in the number $N$ of equal subdivisions for each dimension.
Figure 5: Behavior of the bumpy-hill AV example system under the unregularized policy $\mu_{\text{DP}}$, the entropy-regularized policy with global $\varepsilon$ correction $\mu$, and the locally-corrected entropy-regularized policy $\mu^{\varepsilon}$.

Theorems & Definitions (16)

Remark 1
Lemma 1: Continuous-Discrete Discrepancy
Theorem 1: Entropy Abstraction Lower Bound
Lemma 2: Discretization Difference Upper-Bound
Theorem 2: Global Discretization Difference Bound
Remark 2
Theorem 3: Local Discretization Difference Bound
Lemma 3: Bounding the magnitude of the gradient $T$
Remark 3
Lemma 4: Derivative of Solution to \ref{['eq:T(c)']}
...and 6 more

Formal Entropy-Regularized Control of Stochastic Systems

TL;DR

Abstract

Formal Entropy-Regularized Control of Stochastic Systems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (16)