Almost Optimal Agnostic Control of Unknown Linear Dynamics

Jacob Carruth; Maximilian F. Eggl; Charles Fefferman; Clarence W. Rowley

Almost Optimal Agnostic Control of Unknown Linear Dynamics

Jacob Carruth, Maximilian F. Eggl, Charles Fefferman, Clarence W. Rowley

Abstract

We consider a simple control problem in which the underlying dynamics depend on a parameter $a$ that is unknown and must be learned. We study three variants of the control problem: Bayesian control, in which we have a prior belief about $a$; bounded agnostic control, in which we have no prior belief about $a$ but we assume that $a$ belongs to a bounded set; and fully agnostic control, in which $a$ is allowed to be an arbitrary real number about which we have no prior belief. In the Bayesian variant, a control strategy is optimal if it minimizes a certain expected cost. In the agnostic variants, a control strategy is optimal if it minimizes a quantity called the worst-case regret. For the Bayesian and bounded agnostic variants above, we produce optimal control strategies. For the fully agnostic variant, we produce almost optimal control strategies, i.e., for any $\varepsilon>0$ we produce a strategy that minimizes the worst-case regret to within a multiplicative factor of $(1+\varepsilon)$.

Almost Optimal Agnostic Control of Unknown Linear Dynamics

Abstract

We consider a simple control problem in which the underlying dynamics depend on a parameter

that is unknown and must be learned. We study three variants of the control problem: Bayesian control, in which we have a prior belief about

; bounded agnostic control, in which we have no prior belief about

but we assume that

belongs to a bounded set; and fully agnostic control, in which

is allowed to be an arbitrary real number about which we have no prior belief. In the Bayesian variant, a control strategy is optimal if it minimizes a certain expected cost. In the agnostic variants, a control strategy is optimal if it minimizes a quantity called the worst-case regret. For the Bayesian and bounded agnostic variants above, we produce optimal control strategies. For the fully agnostic variant, we produce almost optimal control strategies, i.e., for any

we produce a strategy that minimizes the worst-case regret to within a multiplicative factor of

Paper Structure (6 sections, 5 theorems, 56 equations)

This paper contains 6 sections, 5 theorems, 56 equations.

The Model System
A Future Direction

Key Result

Theorem 1

Fix a probability distribution $d\emph{Prior}$, supported on $[-a_\emph{max}, a_\emph{max}]$, and suppose our PDE Assumption is satisfied. Let $\sigma = \sigma_{\emph{Bayes}}(d\emph{Prior})$ be the strategy obtained by solving eq: 11--eq: 15. Then

Theorems & Definitions (5)

Theorem 1: Optimal Bayesian Strategy
Theorem 2: Quantitative Uniqueness of the Optimal Bayesian Strategy
Theorem 3
Lemma 1: Agnostic Control Lemma
Theorem 4

Almost Optimal Agnostic Control of Unknown Linear Dynamics

Abstract

Almost Optimal Agnostic Control of Unknown Linear Dynamics

Authors

Abstract

Table of Contents

Key Result

Theorems & Definitions (5)