Information Geometry of Absorbing Markov-Chain and Discriminative Random Walks
Masanari Kimura
TL;DR
This work addresses the theoretical gap in discriminative random walks (DRWs) for semi-supervised node classification by treating DRWs as a parametric family of hitting-time laws on an absorbing Markov chain and analyzing them via information geometry. It derives closed-form first-passage laws (pmf/pgf), moments, and gradients using a Lyapunov equation for the fundamental matrix, revealing that the observed Fisher information for each seed is rank-one and that a quotient manifold of identifiable edge-weight directions is globally flat with dimension equal to the rank $r$ of the aggregate sensitivity. A Fisher-bounded sensitivity score is defined on this quotient, linking geometric directions to the maximal first-order change in DRW betweenness under unit perturbations and enabling principled strategies for active label acquisition, edge reweighting, and explanations. Collectively, the framework provides a computable, interpretable geometric lens on DRW-based semi-supervised learning with practical implications for label collection and graph design.
Abstract
Discriminative Random Walks (DRWs) are a simple yet powerful tool for semi-supervised node classification, but their theoretical foundations remain fragmentary. We revisit DRWs through the lens of information geometry, treating the family of class-specific hitting-time laws on an absorbing Markov chain as a statistical manifold. Starting from a log-linear edge-weight model, we derive closed-form expressions for the hitting-time probability mass function, its full moment hierarchy, and the observed Fisher information. The Fisher matrix of each seed node turns out to be rank-one, taking the quotient by its null space yields a low-dimensional, globally flat manifold that captures all identifiable directions of the model. Leveraging the geometry, we introduce a sensitivity score for unlabeled nodes that bounds, and in one-dimensional cases attains, the maximal first-order change in DRW betweenness under unit Fisher perturbations. The score can lead to principled strategies for active label acquisition, edge re-weighting, and explanation.
