Table of Contents
Fetching ...

Optimising two-block averaging kernels to speed up Markov chains

Ryan J. Y. Lim, Michael C. H. Choi

TL;DR

Several algorithmic approximations, including majorisation-minimisation and coordinate descent schemes, are proposed as computationally feasible alternatives to exhaustive combinatorial search and reveal that optimal cuts under the two objectives can substantially reduce total variation distance to stationarity.

Abstract

We study the problem of selecting optimal two-block partitions to accelerate the mixing of finite Markov chains under group-averaging transformations. The main objectives considered are the Kullback-Leibler (KL) divergence and the Frobenius distance to stationarity. We establish explicit connections between these objectives and the induced projection chain. In the case of the KL divergence, this reduction yields explicit decay rates in terms of the log-Sobolev constant. For the Frobenius distance, we identify a Cheeger-type functional that characterises optimal cuts. This formulation recasts two-block selection as a structured combinatorial optimisation problem admitting difference-of-submodular decompositions. We further propose several algorithmic approximations, including majorisation-minimisation and coordinate descent schemes, as computationally feasible alternatives to exhaustive combinatorial search. Our numerical experiments reveal that optimal cuts under the two objectives can substantially reduce total variation distance to stationarity and demonstrate the practical effectiveness of the proposed approximation algorithms.

Optimising two-block averaging kernels to speed up Markov chains

TL;DR

Several algorithmic approximations, including majorisation-minimisation and coordinate descent schemes, are proposed as computationally feasible alternatives to exhaustive combinatorial search and reveal that optimal cuts under the two objectives can substantially reduce total variation distance to stationarity.

Abstract

We study the problem of selecting optimal two-block partitions to accelerate the mixing of finite Markov chains under group-averaging transformations. The main objectives considered are the Kullback-Leibler (KL) divergence and the Frobenius distance to stationarity. We establish explicit connections between these objectives and the induced projection chain. In the case of the KL divergence, this reduction yields explicit decay rates in terms of the log-Sobolev constant. For the Frobenius distance, we identify a Cheeger-type functional that characterises optimal cuts. This formulation recasts two-block selection as a structured combinatorial optimisation problem admitting difference-of-submodular decompositions. We further propose several algorithmic approximations, including majorisation-minimisation and coordinate descent schemes, as computationally feasible alternatives to exhaustive combinatorial search. Our numerical experiments reveal that optimal cuts under the two objectives can substantially reduce total variation distance to stationarity and demonstrate the practical effectiveness of the proposed approximation algorithms.
Paper Structure (28 sections, 40 theorems, 188 equations, 5 figures, 2 tables)

This paper contains 28 sections, 40 theorems, 188 equations, 5 figures, 2 tables.

Key Result

Proposition 3.1

For any $P \in \mathcal{S}(\pi)$ and any partition $\mathcal{X} = \bigsqcup_{i=1}^k \mathcal{O}_i$, it holds that for $l \in \mathds{N}$ where $x \in \mathcal{O}_i$, $y \in \mathcal{O}_j.$

Figures (5)

  • Figure 1: Plot of worst-case TV distance for $G_S P G_S$ chosen amongst different criteria
  • Figure 2: Cut visualisation by magnetisation for the Curie--Weiss model with $d=4$.
  • Figure 3: Plot of worst-case TV distance for $G_S P$ chosen amongst different criteria
  • Figure 4: Cut visualisation by magnetisation for the Curie--Weiss model with $d=4$.
  • Figure 5: Comparison of cuts between true Frobenius objective via brute-force search and 1/2-approximate minimiser given in Proposition \ref{['prop:1/2-approx']} and \ref{['prop:1/2-approxGPG']}

Theorems & Definitions (70)

  • Proposition 3.1
  • proof
  • Proposition 3.2
  • proof
  • Proposition 3.3
  • proof
  • Corollary 3.4
  • Proposition 3.5
  • proof
  • Corollary 3.6
  • ...and 60 more