A curiously slowly mixing Markov chain

Persi Diaconis; Andrew Lin; Arun Ram

A curiously slowly mixing Markov chain

Persi Diaconis, Andrew Lin, Arun Ram

TL;DR

This paper analyzes the Burnside process on the binary hypercube $C_2^n$, revealing an extreme discrepancy between $\ell^1$ (total-variation) and $\ell^2$ (chi-square) mixing times. It achieves this through an explicit diagonalization of the transition kernel: the nonzero eigenvalues are $\beta_k=\frac{1}{2^{4k}}\binom{2k}{k}^2$ with multiplicities $\binom{n}{2k}$, and corresponding eigenvectors are constructed via lifted discrete Chebyshev polynomials. The authors prove that, for most starting states, the $\ell^2$ mixing time scales as $\Theta\left(\frac{n}{\log n}\right)$, whereas starting from the all-zeros state yields rapid $\ell^2$ convergence; in contrast, $\ell^1$ mixing is bounded by a constant for all starting states. They also demonstrate the lumping structure that reduces the analysis to orbits and discuss extensions to $C_k^n$, highlighting both similarities and substantial challenges beyond the binary case. The work provides sharp qualitative and quantitative insights into how spectral structure and symmetry govern mixing across norms, with implications for related Markov chains and sampling algorithms.

Abstract

We study a Markov chain with very different mixing rates depending on how mixing is measured. The chain is the "Burnside process on the hypercube $C_2^n$." Started at the all-zeros state, it mixes in a bounded number of steps, no matter how large $n$ is, in $\ell^1$ and in $\ell^2$. And started at general $x$, it mixes in at most $\log n$ steps in $\ell^1$. But, in $\ell^2$, it takes $\frac{n}{\log n}$ steps for most starting $x$. The $\ell^2$ mixing results follow from an explicit diagonalization of the Markov chain into binomial-coefficient-valued eigenvectors.

A curiously slowly mixing Markov chain

TL;DR

This paper analyzes the Burnside process on the binary hypercube

, revealing an extreme discrepancy between

(total-variation) and

(chi-square) mixing times. It achieves this through an explicit diagonalization of the transition kernel: the nonzero eigenvalues are

with multiplicities

, and corresponding eigenvectors are constructed via lifted discrete Chebyshev polynomials. The authors prove that, for most starting states, the

mixing time scales as

, whereas starting from the all-zeros state yields rapid

convergence; in contrast,

mixing is bounded by a constant for all starting states. They also demonstrate the lumping structure that reduces the analysis to orbits and discuss extensions to

, highlighting both similarities and substantial challenges beyond the binary case. The work provides sharp qualitative and quantitative insights into how spectral structure and symmetry govern mixing across norms, with implications for related Markov chains and sampling algorithms.

Abstract

We study a Markov chain with very different mixing rates depending on how mixing is measured. The chain is the "Burnside process on the hypercube

." Started at the all-zeros state, it mixes in a bounded number of steps, no matter how large

is, in

and in

. And started at general

, it mixes in at most

steps in

. But, in

, it takes

steps for most starting

. The

mixing results follow from an explicit diagonalization of the Markov chain into binomial-coefficient-valued eigenvectors.

A curiously slowly mixing Markov chain

TL;DR

Abstract

A curiously slowly mixing Markov chain

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (38)