A Vector Bernstein Inequality for Self-Normalized Martingales
Ingvar Ziemann
TL;DR
The paper derives a Bernstein-type bound for vector-valued self-normalized martingales by reinterpreting the classical pseudo-maximization approach through the PAC-Bayesian inequality. It replaces Gaussian priors with uniform ellipsoidal priors to obtain analytically tractable, variance-sensitive bounds that depend on the true variance proxy $\sigma_{\mathrm{var}}^2$ rather than a sub-Gaussian surrogate. The main result provides a high-probability bound on $\|S_\tau\|_{(V_\tau+\Gamma)^{-1}}^2$ with explicit constants and sample-size dependent log-determinant terms, improving upon previous bounds in least-squares and linear bandit analyses. Auxiliary results establish the requisite mgf bounds, KL computations, and volume-based arguments that support the PAC-Bayesian derivations. Overall, the work sharpens concentration guarantees for sequential linear models by delivering a variance-aware Bernstein bound suitable for vector martingales and stopping times, with potential implications for regret analyses in linear bandits and autoregression tasks.
Abstract
We prove a Bernstein inequality for vector-valued self-normalized martingales. We first give an alternative perspective of the corresponding sub-Gaussian bound due to Abbasi-Yadkori et al. via a PAC-Bayesian argument with Gaussian priors. By instantiating this argument to priors drawn uniformly over well-chosen ellipsoids, we obtain a Bernstein bound.
