Table of Contents
Fetching ...

Towards a Banach Space Chernoff Bound for Markov Chains via Chaining Arguments

Shravas Rao

Abstract

Let $\{Y_i\}_{i=1}^{\infty}$ be a stationary reversible Markov chain with state space $[N]$, let $(X, \| \cdot \|)$ be a real-valued Banach space and let $f_1, \ldots, f_n: [N] \rightarrow X$ be functions with mean $0$ such that $\|f_i(v)\| \leq 1$ for all $i$ and $v$. We prove bounds on the expected value of and deviation bounds for the random variable $\|f_1(Y_1)+\cdots+f_n(Y_n)\|$. For large enough $n$ that depends on the Banach space (and not $N$), these bounds behave similarly as known bounds for independent random variables. When the Banach space in question is the set of matrices equipped with the $\ell_2 \rightarrow \ell_2$ operator norm, for large enough $n$, our bounds on the expected value improve upon known bounds and match what is known for independent random variables up to a factor in the spectral gap.

Towards a Banach Space Chernoff Bound for Markov Chains via Chaining Arguments

Abstract

Let be a stationary reversible Markov chain with state space , let be a real-valued Banach space and let be functions with mean such that for all and . We prove bounds on the expected value of and deviation bounds for the random variable . For large enough that depends on the Banach space (and not ), these bounds behave similarly as known bounds for independent random variables. When the Banach space in question is the set of matrices equipped with the operator norm, for large enough , our bounds on the expected value improve upon known bounds and match what is known for independent random variables up to a factor in the spectral gap.

Paper Structure

This paper contains 7 sections, 10 theorems, 46 equations.

Key Result

Theorem 1.2

Let $(X, \|\cdot\|)$ be a Banach space so that $\rho_X(\tau) \leqslant s \tau^2$ for some $s$ and all $\tau > 0$. Let $f_1, \ldots, f_n: [N] \rightarrow X$ be functions such that $\mathbb{E}[f(Y_i)] = 0$ for all $i$ and $\|f_i(v)\| \leqslant 1$ for all $i$ and $v \in [N]$. Let $Y_1, \ldots, Y_n$ be for some universal constant $C$.

Theorems & Definitions (16)

  • Definition 1.1
  • Theorem 1.2: Theorem, N12
  • Theorem 1.3
  • Theorem 1.4
  • Theorem 1.5
  • Definition 2.1
  • Theorem 2.2: Theorem 4.5.13 in T21
  • Theorem 2.3: Talagrand's majorizing measures theorem
  • Theorem 3.1: Theorem 3.3 in P15
  • Corollary 3.2
  • ...and 6 more