Learning of networked spreading models from noisy and incomplete data
Mateusz Wilinski, Andrey Y. Lokhov
TL;DR
Addressing learning of spreading dynamics from incomplete and uncertain data, the paper introduces SLICER+ based on Dynamic Message Passing to jointly infer network structure and Independent Cascade parameters $\alpha_{ij}$. It integrates prior information and uncertainty models via a KL-based objective and a Lagrangian constraint framework, achieving linear per-iteration complexity $O(|E|T|S|)$. Empirical results on synthetic networks and real-world data demonstrate robustness to missing data, noisy timestamps, and partial observability, enabling accurate structure recovery and parameter estimation. This work advances practical diffusion modeling by providing a universal, scalable method that accommodates multiple data imperfections simultaneously.
Abstract
Recent years have seen a lot of progress in algorithms for learning parameters of spreading dynamics from both full and partial data. Some of the remaining challenges include model selection under the scenarios of unknown network structure, noisy data, missing observations in time, as well as an efficient incorporation of prior information to minimize the number of samples required for an accurate learning. Here, we introduce a universal learning method based on scalable dynamic message-passing technique that addresses these challenges often encountered in real data. The algorithm leverages available prior knowledge on the model and on the data, and reconstructs both network structure and parameters of a spreading model. We show that a linear computational complexity of the method with the key model parameters makes the algorithm scalable to large network instances.
