A curiously slowly mixing Markov chain
Persi Diaconis, Andrew Lin, Arun Ram
TL;DR
This paper analyzes the Burnside process on the binary hypercube $C_2^n$, revealing an extreme discrepancy between $\ell^1$ (total-variation) and $\ell^2$ (chi-square) mixing times. It achieves this through an explicit diagonalization of the transition kernel: the nonzero eigenvalues are $\beta_k=\frac{1}{2^{4k}}\binom{2k}{k}^2$ with multiplicities $\binom{n}{2k}$, and corresponding eigenvectors are constructed via lifted discrete Chebyshev polynomials. The authors prove that, for most starting states, the $\ell^2$ mixing time scales as $\Theta\left(\frac{n}{\log n}\right)$, whereas starting from the all-zeros state yields rapid $\ell^2$ convergence; in contrast, $\ell^1$ mixing is bounded by a constant for all starting states. They also demonstrate the lumping structure that reduces the analysis to orbits and discuss extensions to $C_k^n$, highlighting both similarities and substantial challenges beyond the binary case. The work provides sharp qualitative and quantitative insights into how spectral structure and symmetry govern mixing across norms, with implications for related Markov chains and sampling algorithms.
Abstract
We study a Markov chain with very different mixing rates depending on how mixing is measured. The chain is the "Burnside process on the hypercube $C_2^n$." Started at the all-zeros state, it mixes in a bounded number of steps, no matter how large $n$ is, in $\ell^1$ and in $\ell^2$. And started at general $x$, it mixes in at most $\log n$ steps in $\ell^1$. But, in $\ell^2$, it takes $\frac{n}{\log n}$ steps for most starting $x$. The $\ell^2$ mixing results follow from an explicit diagonalization of the Markov chain into binomial-coefficient-valued eigenvectors.
