Source Detection in Networks using the Stationary Distribution of a Markov Chain
Yael Sabato, Amos Azaria, Noam Hazon
TL;DR
The paper tackles identifying the diffusion source in networks under the Independent Cascade model using maximum likelihood estimation, noting that exact computation is intractable. It introduces a principled Markov-chain based method that leverages the stationary distribution, via the Markov chain tree theorem, to aggregate over all possible diffusion trees. Two network-to-Markov-chain conversion schemes, Self-Loops and No-Loops, enable exact or near-exact recovery of the source likelihoods from a single stationary distribution, with the No-Loops variant showing favorable performance under sampling. Experiments on 14 random graph families and 9 real networks demonstrate that the proposed approach outperforms existing baselines, offering a scalable ML-based solution for IC diffusion source detection. This method has practical impact for rapid and principled source tracing in social networks and related diffusion settings.
Abstract
Nowadays, the diffusion of information through social networks is a powerful phenomenon. One common way to model diffusions in social networks is the Independent Cascade (IC) model. Given a set of infected nodes according to the IC model, a natural problem is the source detection problem, in which the goal is to identify the unique node that has started the diffusion. Maximum Likelihood Estimation (MLE) is a common approach for tackling the source detection problem, but it is computationally hard. In this work, we propose an efficient method for the source detection problem under the MLE approach, which is based on computing the stationary distribution of a Markov chain. Using simulations, we demonstrate the effectiveness of our method compared to other state-of-the-art methods from the literature, both on random and real-world networks.
