Optimistic Estimation of Convergence in Markov Chains with the Average-Mixing Time
Geoffrey Wolfer, Pierre Alquier
TL;DR
The paper argues that the conventional worst-case mixing time $t_{\mathsf{mix}}$ is often pessimistic and hard to estimate from data. It advocates the average-mixing time $t^{\sharp}_{\mathsf{mix}}(\xi)$, defined via the stationary $\beta$-mixing coefficients, as an optimistic and estimable proxy for convergence of Markov chains, applicable to finite and countable state spaces. The authors develop single-trajectory estimators for the $\beta$-mixing coefficients and $t^{\sharp}_{\mathsf{mix}}(\xi)$, with explicit finite-sample guarantees under sub-exponential and polynomial mixing, and provide spectral, ergodic, and graph-structure-informed bounds. They also demonstrate implications across state-space scales, including finite spaces, countable spaces, and infinite graphs with controlled growth, and illustrate potential gaps between worst-case and average convergence using a two-point space example. Overall, the work offers a practical framework for data-driven convergence assessment and provides a pathway to more efficient probabilistic guarantees in learning and MCMC diagnostics with weak dependencies.
Abstract
The convergence rate of a Markov chain to its stationary distribution is typically assessed using the concept of total variation mixing time. However, this worst-case measure often yields pessimistic estimates and is challenging to infer from observations. In this paper, we advocate for the use of the average-mixing time as a more optimistic and demonstrably easier-to-estimate alternative. We further illustrate its applicability across a range of settings, from two-point to countable spaces, and discuss some practical implications.
