Diffusion Signals Reveal Hidden Connections: A Physics-Inspired Framework for Link Prediction via Personalized PageRank Signals
Huilin Wang Wenjun Zhang Weibing Deng
TL;DR
This work addresses the challenge of accurate, scalable link prediction in complex networks by introducing a physics-inspired framework that blends static topology with dynamical information flow. It defines D-PPR, which uses Personalized PageRank signals as multi-scale structural fingerprints and measures their diffusion on the graph via the Laplacian to produce a diffusion-based similarity score. The key contributions are the formalization of diffusion distance on PPR signals, the reciprocal scoring scheme, and extensive validation on Barabási-Albert, LFR, and seven real networks, where D-PPR often outperforms traditional local and global baselines, especially in sparse or modular graphs. The approach deepens understanding of the topology-dynamics interplay in networks and offers a robust, dynamics-aware tool for link prediction with potential extensions to directed, weighted, and temporal networks and links to graph representation learning.
Abstract
Link prediction in complex networks--identifying the missing or future connections--remains a cornerstone problem for understanding network evolution and function, yet existing methods struggle to balance computational efficiency with theoretical rigor across heterogeneous topologies. This work introduces a physically principled framework, Diffusion Distance with Personalized PageRank (D-PPR), which unifies static topology with dynamic information flow by modeling nodes as signal sources propagating through the network via Personalized PageRank (PPR) vectors. The method quantifies node-pair similarity through the graph Laplacian-governed diffusion distance between their topology-aware signal distributions, thereby bridging microscopic interactions with macroscopic network dynamics. Systematic benchmarking on synthetic (Barabási-Albert, LFR) and seven large-scale real-world networks spanning technology, biology, and social domains demonstrates that D-PPR achieves highly competitive performance, yielding favorable results when compared to representative local and global heuristics, particularly in sparse and modular networks. These findings establish a rigorous foundation for physics-inspired link prediction by revealing that incorporating dynamical processes into structural similarity metrics enables deeper insights into network connectivity patterns, offering both methodological advances and new theoretical perspectives on the interplay between topology and dynamics.
