Formal Error Bounds for the State Space Reduction of Markov Chains
Fabian Michel, Markus Siegle
TL;DR
This work develops formal error bounds for reducing Markov chains to smaller state spaces, covering both discrete- and continuous-time settings and a highly general aggregation framework that allows arbitrary disaggregation and aggregated dynamics. It proves tight transient-distribution bounds, connects them to dynamic-exact and exact lumpability concepts, and analyzes stationary-distribution bounds, showing when aggregation preserves or approximates stationarity. The authors introduce and compare practical aggregation algorithms (notably SVD-based methods and almost-exact lumpability) and demonstrate substantial speedups across diverse models while quantifying the trade-offs between accuracy and reduced dimension. The results offer a unified theoretical and algorithmic toolkit for reliable state-space reduction in stochastic models, with clear guidance on selecting aggregations and understanding their impact on transient and stationary behavior.
Abstract
We study the approximation of a Markov chain on a reduced state space, for both discrete- and continuous-time Markov chains. In this context, we extend the existing theory of formal error bounds for the approximated transient distributions. As a special case, we consider aggregated (or lumped) Markov chains, where the state space reduction is achieved by partitioning the state space into macro states. In the discrete-time setting, we bound the stepwise increment of the error, and in the continuous-time setting, we bound the rate at which the error grows. In addition, the same error bounds can also be applied to bound how far an approximated stationary distribution is from stationarity. Subsequently, we compare these error bounds with relevant concepts from the literature, such as exact and ordinary lumpability, as well as deflatability and aggregatability. These concepts define stricter than necessary conditions to identify settings in which the aggregation error is zero. We also consider possible algorithms for finding suitable aggregations for which the formal error bounds are low, and we analyse first experiments with these algorithms on a range of different models.
