A distance function for stochastic matrices
Antony R. Lee, Peter Tino, Iain Bruce Styles
TL;DR
This work develops a principled information-geometry-based framework for comparing Markov chains by introducing a Bhattacharyya-angle distance on Markov-sequence traces and a corresponding stochastic-matrix distance. It shows that, for ergodic chains, the sequence-space and matrix-space distances converge to consistent measures governed by stationary distributions, and it provides closed-form results for diagonalisable and primitive matrices. The approach yields tractable, true-metric distances that are computable from the stochastic matrices themselves, enabling robust comparisons across healthcare process models and other applications. The findings open avenues for principled model comparison in settings where initial conditions should be decoupled from the chain structure, with potential integrations into clustering, monitoring, and ML loss functions for categorical models.
Abstract
Motivated by information geometry, a distance function on the space of stochastic matrices is advocated. Starting with sequences of Markov chains the Bhattacharyya angle is advocated as the natural tool for comparing both short and long term Markov chain runs. Bounds on the convergence of the distance and mixing times are derived. Guided by the desire to compare different Markov chain models, especially in the setting of healthcare processes, a new distance function on the space of stochastic matrices is presented. It is a true distance measure which has a closed form and is efficient to implement for numerical evaluation. In the case of ergodic Markov chains, it is shown that considering either the Bhattacharyya angle on Markov sequences or the new stochastic matrix distance leads to the same distance between models.
