Observing network dynamics through sentinel nodes

Neil G. MacLaren; Baruch Barzel; Naoki Masuda

Observing network dynamics through sentinel nodes

Neil G. MacLaren, Baruch Barzel, Naoki Masuda

TL;DR

This work tackles the observability challenge in large, nonlinear, and heterogeneous networks by identifying a small sentinel node set that can accurately track the network’s average equilibrium state. The authors formulate an optimization framework, solve it via combinatorial simulated annealing (with optional quadratic-programming-based weight optimization), and demonstrate that a tiny, topology-aware subset can reproduce the full-network mean across multiple dynamics and network types. Importantly, sentinel sets exhibit transferability across different dynamical rules, enabling observations even when the governing dynamics are unknown, and extend to empirical data such as fMRI brain networks with modest gains. The approach offers scalable, practical observability for complex systems, with implications for monitoring ecosystems, neural networks, and epidemic processes, while acknowledging limitations related to monostable dynamics and the need for richer theoretical grounding.

Abstract

A fundamental premise of statistical physics is that the particles in a physical system are interchangeable, and hence the state of each specific component is representative of the system as a whole. This assumption breaks down for complex networks, in which nodes may be extremely diverse, and no single component can truly represent the state of the entire system. It seems, therefore, that to observe the dynamics of social, biological or technological networks, one must extract the dynamic states of a large number of nodes -- a task that is often practically prohibitive. Theoretical tools are also highly restrictive, given the analytically impenetrable combination of complex heterogeneous networks with nonlinear, often hidden, dynamics. To overcome this challenge, we use machine learning techniques to detect the network's sentinel nodes, a set of network components whose combined states can help approximate the average dynamics of the entire network. The method allows us to assess the equilibrium state of a large complex system by tracking just a small number of carefully selected nodes. We find that the sentinels are mainly determined by the network structure such that they can be extracted even with little knowledge of the system's specific interaction dynamics. Therefore, the network's sentinels offer a natural probe by which to observe the system's dynamic states. Intriguingly, sentinels tend to avoid the highly central nodes such as the hubs.

Observing network dynamics through sentinel nodes

TL;DR

Abstract

Paper Structure (41 sections, 15 equations, 34 figures, 23 tables)

This paper contains 41 sections, 15 equations, 34 figures, 23 tables.

Introduction
Results
Seeking the sentinel nodes
Characteristics of good individual sentinel nodes
Characteristics of good sentinel node sets
Transfer learning: Effectiveness of optimized node sets across different dynamics
Sentinel nodes in empirical time series data
Weighted averaging
Discussion
Methods
Node Selection
Dynamical system models
Networks
Optimization of node weights
Undirected and unweighted networks used in the analysis
...and 26 more sections

Figures (34)

Figure 1: Observing network dynamics via sentinel nodes. Approximations of the average activity, $\overline{x}$, of the coupled double-well dynamics on the dolphin network. (a) Coupled double-well dynamics. (b) Dolphin network with $N=62$ nodes. Larger circles (colored) represent nodes in four different sentinel node sets $S$ with sizes $n=1$ (orange), $n=2$ (yellow), $n=3$ (green), and $n=4$ (blue). (c)--(f) The equilibrium states $x_i^*$ of all nodes in the dolphin network as a function of $D$, obtained from numerical simulations (grey lines). The average state $\overline{x}$ is also shown (black solid lines). The purple solid lines represent the sentinel node approximations $\overline{x}^\prime$, obtained from the four node sets highlighted in (b). For $n = 1$, the approximation substantially deviates from the actual $\overline{x}$ (yielding $\varepsilon = 0.035$; see panel (c)) because, just a single sentinel node is used. This deviation is gradually reduced in (d), (e), and (f) as $n$ is increased, until it is practically eliminated at $n=4$ ($\varepsilon = 0.004$). In (c)--(f), we also show the individual sentinel node states (red, yellow, green and blue lines).
Figure 2: Sentinel node approximation for different networks and dynamics. (a) Approximation errors, $\varepsilon$, for 100 optimized (blue circles), degree-preserving (orange triangles), and completely random (green squares) node sets using the coupled double-well dynamics on 15 empirical and 5 model networks. The networks range in size from $N=62$ (Dolphin network) to $N=81,171$ (Prosper network); the model networks each have approximately $1,000$ nodes. ER: Erdős-Rényi. BA: Barabási-Albert. HK: Holme-Kim. GKK: Goh-Kahng-Kim. LFR: Lancichinetti-Fortunato-Radicchi. (b) Approximation errors for 100 node sets of each type using the SIS dynamics on the same 20 networks.
Figure 3: Characteristics of sentinel nodes. (a) Degree histogram of the dolphin network. (b) Degree histogram extracted from $100$ single sentinel nodes ($n = 1$). (c)--(e) Similar histograms across $100$ sets with $n = 2$, $3$, and $4$ nodes. We find that sentinel node sets tend to spread across the degree sequence. Here, the bar color represents the node rank in $S$ in terms of degree: blue - smallest-degree node, orange - second smallest, red - third, light blue - fourth. (f) We repeated the analysis for a BA network of $N = 10^3$ nodes. For this network, we observe the hub nodes at the tail of the histogram, with degrees ranging from $k = 20$ to $85$ (inset). (g)--(j) Histograms extracted from the sentinel node sets with $n = 1,2,3$, and $6$. All histograms with $n > 2$ comprise a mixture of small and intermediate degree nodes, but lack any representation of the hub nodes (see empty insets). Bar colors are the same as in panels (b)--(e), with two additional ranks: green - fifth smallest node, yellow - sixth smallest node.
Figure 4: Degree distribution of optimized vs. random sets. We collected $100$ optimized sets of $n = 1,\dots,12$ nodes from the dolphin network and extracted their collective degree distributions. (a) The Kullback-Leibler divergence, $D_{\rm KL}$, between the optimized sets and the original network as $n$ varies (blue circles). The results obtained from $100$ completely random node sets are also shown (green squares); the 95% confidence interval for each mean is smaller than the symbols and therefore not shown. For all $n$ values, and especially for the smaller sets, we find that the sentinel $D_{\rm KL}$ is significantly larger than that of the random sets. (b) Similar results are obtained for the BA network. Here the gap between the two set types is even more pronounced, likely due to the presence of hubs in the network and their absence in $S$.
Figure 5: Transfer learnability of the sentinel node approximation. We used the dolphin network and our four dynamic models to construct all potential pairs of training and test dynamics. (a) The approximation error $\varepsilon$ obtained from the test dynamics vs. that of the training dynamics for our optimized node sets (blue), degree-preserving node sets (orange), and completely random node sets (green). For example, in panel (a1), we extract the sentinels using the mutualistic species dynamics ($x$-axis, training) then examine their performance against the coupled double-well dynamics ($y$-axis, test). For each node selection method, we present results from $100$ independent sets by small symbols and the averaged $\varepsilon$ by large symbols. In all but two panels, we find that our optimized node sets perform best, both against the training dynamics and, most crucially, against the test dynamics. (b) Dolphin network. The large blue nodes indicate the members of the optimized node set with $n=4$ nodes that attains the lowest approximation error on the coupled double-well training dynamics. (c) Sentinel node approximation for the coupled double-well dynamics on the dolphin network. The states of the nodes marked in (b) are shown in blue. The states of the remaining nodes are shown in gray. The average state $\overline{x}$ (black solid line) and the sentinel node approximation (purple solid line) are also shown. As expected, the sentinel node approximation follows $\overline{x}$ quite tightly. (d) Sentinel node approximation for the SIS dynamics on the same network, using the node set optimized for the coupled double-well dynamics. Despite being trained on a different dynamics, the four blue nodes still offer a reliable approximation for the SIS dynamics.
...and 29 more figures

Observing network dynamics through sentinel nodes

TL;DR

Abstract

Observing network dynamics through sentinel nodes

Authors

TL;DR

Abstract

Table of Contents

Figures (34)