Table of Contents
Fetching ...

Almost Optimal Agnostic Control of Unknown Linear Dynamics

Jacob Carruth, Maximilian F. Eggl, Charles Fefferman, Clarence W. Rowley

Abstract

We consider a simple control problem in which the underlying dynamics depend on a parameter $a$ that is unknown and must be learned. We study three variants of the control problem: Bayesian control, in which we have a prior belief about $a$; bounded agnostic control, in which we have no prior belief about $a$ but we assume that $a$ belongs to a bounded set; and fully agnostic control, in which $a$ is allowed to be an arbitrary real number about which we have no prior belief. In the Bayesian variant, a control strategy is optimal if it minimizes a certain expected cost. In the agnostic variants, a control strategy is optimal if it minimizes a quantity called the worst-case regret. For the Bayesian and bounded agnostic variants above, we produce optimal control strategies. For the fully agnostic variant, we produce almost optimal control strategies, i.e., for any $\varepsilon>0$ we produce a strategy that minimizes the worst-case regret to within a multiplicative factor of $(1+\varepsilon)$.

Almost Optimal Agnostic Control of Unknown Linear Dynamics

Abstract

We consider a simple control problem in which the underlying dynamics depend on a parameter that is unknown and must be learned. We study three variants of the control problem: Bayesian control, in which we have a prior belief about ; bounded agnostic control, in which we have no prior belief about but we assume that belongs to a bounded set; and fully agnostic control, in which is allowed to be an arbitrary real number about which we have no prior belief. In the Bayesian variant, a control strategy is optimal if it minimizes a certain expected cost. In the agnostic variants, a control strategy is optimal if it minimizes a quantity called the worst-case regret. For the Bayesian and bounded agnostic variants above, we produce optimal control strategies. For the fully agnostic variant, we produce almost optimal control strategies, i.e., for any we produce a strategy that minimizes the worst-case regret to within a multiplicative factor of .
Paper Structure (6 sections, 5 theorems, 56 equations)

This paper contains 6 sections, 5 theorems, 56 equations.

Key Result

Theorem 1

Fix a probability distribution $d\emph{Prior}$, supported on $[-a_\emph{max}, a_\emph{max}]$, and suppose our PDE Assumption is satisfied. Let $\sigma = \sigma_{\emph{Bayes}}(d\emph{Prior})$ be the strategy obtained by solving eq: 11--eq: 15. Then

Theorems & Definitions (5)

  • Theorem 1: Optimal Bayesian Strategy
  • Theorem 2: Quantitative Uniqueness of the Optimal Bayesian Strategy
  • Theorem 3
  • Lemma 1: Agnostic Control Lemma
  • Theorem 4