Stochastic $p$th root approximation of a stochastic matrix: A Riemannian optimization approach
Fabio Durastante, Beatrice Meini
TL;DR
The paper addresses the problem of approximating the $p$th root of a stochastic matrix by a stochastic matrix, leveraging Riemannian optimization on manifolds of positive stochastic matrices. It introduces two complementary approaches: (i) optimization on the multinomial manifold $ S_n$ to minimize $ frac{1}{2}\|X^p-A ight\|_F^2$, and (ii) optimization on the new manifold $ S_n^{oldsymbol{oldsymbol{ u}}}$ of matrices sharing the stationary distribution with $A$, ensuring the root preserves the stationary vector. The authors derive the necessary tangent spaces, projections, Riemannian gradients, and retractions (notably a Sinkhorn-based retraction), implement the methods in MATLAB/Manopt, and demonstrate that the Riemannian methods typically outperform constrained optimization in speed and accuracy, with the pi-preserving variant offering stationarity guarantees at a controlled cost. They also provide careful treatment of computational challenges, including the linear systems that arise in projections, spectral bounds, and preconditioning strategies, and show applicability to reducible chains through perturbation techniques. Overall, the work advances stochastic matrix embeddings by enabling efficient, structure-preserving approximations with practical impact on Markov-chain modeling and embedding problems.
Abstract
We propose two approaches, based on Riemannian optimization, for computing a stochastic approximation of the $p$th root of a stochastic matrix $A$. In the first approach, the approximation is found in the Riemannian manifold of positive stochastic matrices. In the second approach, we introduce the Riemannian manifold of positive stochastic matrices sharing with $A$ the Perron eigenvector and we compute the approximation of the $p$th root of $A$ in such a manifold. This way, differently from the available methods based on constrained optimization, $A$ and its $p$th root approximation share the Perron eigenvector. Such a property is relevant, from a modelling point of view, in the embedding problem for Markov chains. The extended numerical experimentation shows that, in the first approach, the Riemannian optimization methods are generally faster and more accurate than the available methods based on constrained optimization. In the second approach, even though the stochastic approximation of the $p$th root is found in a smaller set, the approximation is generally more accurate than the one obtained by standard constrained optimization.
