Modeling diffusion in networks with communities: a multitype branching process approach
Alina Dubovskaya, Caroline B. Pena, David J. P. O'Sullivan
TL;DR
Diffusion on networks with community structure is analyzed via a multitype branching process combined with probability-generating functions to obtain distributional properties of cascades. The method handles a simple contagion under the Independent Cascade Model and yields closed-form expressions for extinction probabilities $q(t)$, hazard functions $h(t)$, and cascade-size distributions for each community and the whole network, including cross-community introduction probabilities. It extends from Poisson SBM to general locally tree-like networks using excess-degree distributions, demonstrated on SBM and log-normal networks, and reveals how initial seeding location affects cascade sizes. The results provide a practical tool for predicting outbreak and diffusion behavior from limited degree-distribution information and offer pathways for future work on directed networks, multiple communities, and data-driven validation.
Abstract
The dynamics of diffusion in complex networks are widely studied to understand how entities, such as information, diseases, or behaviors, spread in an interconnected environment. Complex networks often present community structure, and tools to analyze diffusion processes on networks with communities are needed. In this paper, we develop theoretical tools using multi-type branching processes to model and analyze diffusion processes, following a simple contagion mechanism, across a broad class of networks with community structure. We show how, by using limited information about the network -- the degree distribution within and between communities -- we can calculate standard statistical characteristics of propagation dynamics, such as the extinction probability, hazard function, and cascade size distribution. These properties can be estimated not only for the entire network but also for each community separately. Furthermore, we estimate the probability of spread crossing from one community to another where it is not currently spreading. We demonstrate the accuracy of our framework by applying it to two specific examples: the Stochastic Block Model and a log-normal network with community structure. We show how the initial seeding location affects the observed cascade size distribution on a heavy-tailed network and that our framework accurately captures this effect.
