Table of Contents
Fetching ...

Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity

Diego Martinez-Taboada, Tomas Gonzalez, Aaditya Ramdas

TL;DR

The paper develops time-uniform concentration inequalities for vector-valued self-normalized processes in separable Hilbert spaces beyond sub-Gaussian tails by introducing a Pinelis-based nonnegative supermartingale that decouples direction from noise. It yields Bernstein- and Bennett-type bounds, including sequential and empirical variants, and ties the pseudo-variance to the information gain via the elliptical potential lemma, enabling dimension-free results. These inequalities are then translated into online inference tools for online linear regression and kernelized linear bandits, providing confidence ellipsoids and regret bounds that adapt to light-tailed noises and variance structure. The work broadens applicability of self-normalized concentration in infinite-dimensional settings and offers practical benefit for sequential decision-making under non-sub-Gaussian noise.

Abstract

The study of self-normalized processes plays a crucial role in a wide range of applications, from sequential decision-making to econometrics. While the behavior of self-normalized concentration has been widely investigated for scalar-valued processes, vector-valued processes remain comparatively underexplored, especially outside of the sub-Gaussian framework. In this contribution, we provide concentration bounds for self-normalized processes with light tails beyond sub-Gaussianity (such as Bennett or Bernstein bounds). We illustrate the relevance of our results in the context of online linear regression, with applications in (kernelized) linear bandits.

Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity

TL;DR

The paper develops time-uniform concentration inequalities for vector-valued self-normalized processes in separable Hilbert spaces beyond sub-Gaussian tails by introducing a Pinelis-based nonnegative supermartingale that decouples direction from noise. It yields Bernstein- and Bennett-type bounds, including sequential and empirical variants, and ties the pseudo-variance to the information gain via the elliptical potential lemma, enabling dimension-free results. These inequalities are then translated into online inference tools for online linear regression and kernelized linear bandits, providing confidence ellipsoids and regret bounds that adapt to light-tailed noises and variance structure. The work broadens applicability of self-normalized concentration in infinite-dimensional settings and offers practical benefit for sequential decision-making under non-sub-Gaussian noise.

Abstract

The study of self-normalized processes plays a crucial role in a wide range of applications, from sequential decision-making to econometrics. While the behavior of self-normalized concentration has been widely investigated for scalar-valued processes, vector-valued processes remain comparatively underexplored, especially outside of the sub-Gaussian framework. In this contribution, we provide concentration bounds for self-normalized processes with light tails beyond sub-Gaussianity (such as Bennett or Bernstein bounds). We illustrate the relevance of our results in the context of online linear regression, with applications in (kernelized) linear bandits.

Paper Structure

This paper contains 35 sections, 9 theorems, 95 equations, 1 figure.

Key Result

Theorem 1

Let $(X_t)_{t \geq 1}$ and $(\epsilon_t)_{t \geq 1}$ be Hilbert space valued and real valued processes, respectively, attaining Assumption assumption:adapted. Let $\lambda>0$ and recall that $G_t = \left( \rho I + V_t \right)^{-\frac{1}{2}} X_t$. Denoting the process is a nonnegative supermartingale.

Figures (1)

  • Figure 1: Illustration of the optimistic upper confidence bounds for the regression function after $500$ rounds using sub-Gaussian, mixed Bennett, and empirical mixed Bennett inequalities for noises following (I) a rescaled uniform distribution, (II) a rescaled ($5, 5$)-beta distribution, (III) a rescaled ($20, 20$)-beta distribution, and (IV) a rescaled ($50, 50$)-beta distribution. Training points are drawn following a UCB procedure, with the covariates $X_t = k(\cdot, \tilde{X}_t)$ illustrated in the original space (pre-embedded in the RKHS).

Theorems & Definitions (10)

  • Theorem 1
  • proof
  • Proposition 1: Light-tailed self-normalized process concentration inequality
  • Corollary 1
  • Theorem 2: Bernstein-type concentration inequality
  • Theorem 3: Bennett-type concentration inequality
  • Theorem 4: Mixed Bennett-type concentration inequality
  • Theorem 5: Mixed empirical Bennett-type concentration bound
  • Corollary 2
  • Corollary 3