A Riemannian Optimization Approach for Finding the Nearest Reversible Markov Chain
Fabio Durastante, Miryam Gnazzo, Beatrice Meini
TL;DR
This work targets the problem of approximating a given Markov chain by a reversible one with the same stationary distribution, minimizing the Frobenius distance between transition matrices. It introduces a Riemannian-optimization approach on a modified multinomial manifold \mathcal{M}_{\boldsymbol{\pi}} endowed with the Fisher information metric, transforming the problem into a symmetric, positive-definite form and solving it with second-order methods. The method naturally handles transient states and decomposes reducible chains into ergodic classes, enabling blockwise optimization and substantial speedups. Compared with quadratic programming, the Riemannian approach achieves close-to-machine-precision reversibility with lower memory and substantially faster runtimes, offering a practical tool for constructing reversible operators in MCMC, molecular dynamics, and data-driven transfer operators. The paper also discusses limitations for sparse matrices and points to future work on lifting techniques and KL-divergence formulations to broaden applicability.
Abstract
We address the algorithmic problem of determining the reversible Markov chain $\tilde X$ that is closest to a given Markov chain $X$, with an identical stationary distribution. More specifically, $\tilde X$ is the reversible Markov chain with the closest transition matrix, in the Frobenius norm, to the transition matrix of $X$. To compute the transition matrix of $\tilde X$, we propose a novel approach based on Riemannian optimization. Our method introduces a modified multinomial manifold endowed with a prescribed stationary vector, while also satisfying the detailed balance conditions, all within the framework of the Fisher metric. We evaluate the performance of the proposed approach in comparison with an existing quadratic programming method and demonstrate its effectiveness through a series of synthetic experiments, as well as in the construction of a reversible Markov chain from transition count data obtained via direct estimation from a stochastic differential equation.
