Comparing Labeled Markov Chains: A Cantor-Kantorovich Approach
Adrien Banse, Alessandro Abate, Raphaël M. Jungers
TL;DR
The paper addresses how to compare Labeled Markov Chains (LMCs) by introducing the Cantor-Kantorovich (CK) distance, which arises from the Cantor topology on $A^\omega$ and can be written as a discounted sum of finite-horizon Total Variation distances: $d(\Sigma_1,\Sigma_2) = \sum_{i=1}^\infty \frac{m-1}{m^i} \mathrm{TV}(p_1^i, p_2^i)$. It establishes that exact CK computation is $\#\mathrm{P}$-hard, proves strong continuity properties and finite-horizon guarantees, and proposes a computable approximation scheme that is itself $\#\mathrm{P}$-hard to compute. The CK distance provides a rigorous, trace-based notion that captures transient behavior and aligns with Cantor-topology intuition, offering practical relevance for abstraction and perturbation analysis in probabilistic systems. Open questions remain on whether CK distance is computable in general and whether there exist efficient polynomial-time approximation methods.
Abstract
Labeled Markov Chains (or LMCs for short) are useful mathematical objects to model complex probabilistic languages. A central challenge is to compare two LMCs, for example to assess the accuracy of an abstraction or to quantify the effect of model perturbations. In this work, we study the recently introduced Cantor-Kantorovich (or CK) distance. In particular we show that the latter can be framed as a discounted sum of finite-horizon Total Variation distances, making it an instance of discounted linear distance, but arising from the natural Cantor topology. Building on the latter observation, we analyze the properties of the CK distance along three dimensions: computational complexity, continuity properties and approximation. More precisely, we show that the exact computation of the CK distance is #P-hard. We also provide an upper bound on the CK distance as a function of the approximation relation between the two LMCs, and show that a bounded CK distance implies a bounded error between probabilities of finite-horizon traces. Finally, we provide a computable approximation scheme, and show that the latter is also #P-hard. Altogether, our results provide a rigorous theoretical foundation for the CK distance and clarify its relationship with existing distances.
