Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks

Zirou Qiu; Abhijin Adiga; Madhav V. Marathe; S. S. Ravi; Daniel J. Rosenkrantz; Richard E. Stearns; Anil Vullikanti

Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks

Zirou Qiu, Abhijin Adiga, Madhav V. Marathe, S. S. Ravi, Daniel J. Rosenkrantz, Richard E. Stearns, Anil Vullikanti

TL;DR

Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks studies how to learn the dynamics of cascade processes on multilayer networks when only a subset of vertex interaction functions are unknown. It develops an efficient PAC learner with a sample complexity that scales as $m_{\mathcal{H}}(\delta,\epsilon) \le \frac{1}{\epsilon}\, \sigma k \log(\frac{\sigma k}{\delta})$, and extends to the PMAC setting for reduced data needs. The work also delivers a tight characterization of model complexity via the Natarajan dimension, showing exact results for single-layer systems ($\text{Ndim}(\mathcal{H})=\sigma$), bounds for multi-layer systems ($\sigma \le \text{Ndim}(\mathcal{H}) \le k\sigma$), and asymptotic equivalence to $k\sigma$ for almost all graphs. The combination of an efficient learning algorithm, rigorous sample-complexity and complexity bounds, and experimental validation provides a solid theoretical foundation for learning multilayer dynamical systems and guides future research on more complex network structures and noisy data.

Abstract

Networked dynamical systems are widely used as formal models of real-world cascading phenomena, such as the spread of diseases and information. Prior research has addressed the problem of learning the behavior of an unknown dynamical system when the underlying network has a single layer. In this work, we study the learnability of dynamical systems over multilayer networks, which are more realistic and challenging. First, we present an efficient PAC learning algorithm with provable guarantees to show that the learner only requires a small number of training examples to infer an unknown system. We further provide a tight analysis of the Natarajan dimension which measures the model complexity. Asymptotically, our bound on the Nararajan dimension is tight for almost all multilayer graphs. The techniques and insights from our work provide the theoretical foundations for future investigations of learning problems for multilayer dynamical systems.

Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks

TL;DR

, and extends to the PMAC setting for reduced data needs. The work also delivers a tight characterization of model complexity via the Natarajan dimension, showing exact results for single-layer systems (

), bounds for multi-layer systems (

), and asymptotic equivalence to

for almost all graphs. The combination of an efficient learning algorithm, rigorous sample-complexity and complexity bounds, and experimental validation provides a solid theoretical foundation for learning multilayer dynamical systems and guides future research on more complex network structures and noisy data.

Abstract

Paper Structure (21 sections, 12 theorems, 29 equations, 8 figures, 2 tables)

This paper contains 21 sections, 12 theorems, 29 equations, 8 figures, 2 tables.

Introduction
Related work.
Preliminaries
Multilayer Networked Dynamical Systems
The Learning Setting
PAC Learnability of Multilayer Systems
An Efficient PAC Learner
The Sample Complexity
Extension to the PMAC Model
Tight Analysis of Model Complexity
An Exact Characterization for a Single Layer
Bounds on Ndim for Multilayer Systems
The Asymptotic Behavior of Ndim
Experimental Analysis
Experimental Results
...and 6 more sections

Key Result

Theorem 3.1

The class $\mathcal{H}{}$ is efficiently PAC learnable.

Figures (8)

Figure 1: A 2-layer threshold system with OR master functions. Threshold values of vertices $v_1$ to $v_4$ in layer 1 are $(2, 3, 3, 2)$, and in layer 2 are $(3, 3, 2, 1)$. State-1 vertices are in blue. The configuration $\mathcal{C} = (1, 1, 1, 0)$, and its successor is $\mathcal{C}' = (1, 0, 0, 1)$.
Figure 2: (a): $\ell$ vs $|\mathcal{T}|$ and (b): $\ell$ vs $\sigma$, over different distributions specified by $p$. The underlying network is Multi-Gnp (Table \ref{['tab:networks']}). The stdev for all data points is less than $0.09$.
Figure 3: (a): $\ell$ vs $|\mathcal{T}|$, over different real-world networks (Table \ref{['tab:networks']}), and (b): $\ell$ vs $k$ over different values of $\sigma$, where the underlying network is Gnp. The stdev for all data points is less than $0.08$.
Figure 4: An alternative interpretation of shattering: associated configurations and $2^{|\mathcal{R}|}$ mappings for a set $\mathcal{R}$.
Figure 5: An example of a contested vertex $v$ for a configuration $\mathcal{C}$. In particular, $\mathcal{C}^A$ and $\mathcal{C}^B$ are the two associated configurations of $\mathcal{C}$. The state of $v$ is highlighted in blue.
...and 3 more figures

Theorems & Definitions (27)

Theorem 3.1
Lemma 3.2
Theorem 3.3
Theorem 3.4
Definition 4.1: Landmark Vertices
Definition 4.2: Canonical Set
Definition 4.3: Contested Vertices
Lemma 4.4
Theorem 4.5
Lemma 4.6
...and 17 more

Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks

TL;DR

Abstract

Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (27)