Transfer Learning for Latent Variable Network Models

Akhil Jalan; Arya Mazumdar; Soumendu Sundar Mukherjee; Purnamrita Sarkar

Transfer Learning for Latent Variable Network Models

Akhil Jalan, Arya Mazumdar, Soumendu Sundar Mukherjee, Purnamrita Sarkar

TL;DR

The paper addresses transfer learning for latent-variable network models, where the edge probability matrix $Q$ is learned using a fully observed source $P$ and sparse target data. It introduces a rankings-based algorithm that exploits shared latent structure via a graph-distance to align latent positions without requiring a parametric form, achieving vanishing estimation error. The authors establish a minimax lower bound for Stochastic Block Models and show a simple clustering-based method attains the rate in the appropriate regime, supported by empirical results on simulated and real networks that illustrate effective cross-domain transfer. Overall, the work demonstrates practical transfer capabilities for estimating latent-network structures across domains, with theoretically justified rates and real-data validation.

Abstract

We study transfer learning for estimation in latent variable network models. In our setting, the conditional edge probability matrices given the latent variables are represented by $P$ for the source and $Q$ for the target. We wish to estimate $Q$ given two kinds of data: (1) edge data from a subgraph induced by an $o(1)$ fraction of the nodes of $Q$, and (2) edge data from all of $P$. If the source $P$ has no relation to the target $Q$, the estimation error must be $Ω(1)$. However, we show that if the latent variables are shared, then vanishing error is possible. We give an efficient algorithm that utilizes the ordering of a suitably defined graph distance. Our algorithm achieves $o(1)$ error and does not assume a parametric form on the source or target networks. Next, for the specific case of Stochastic Block Models we prove a minimax lower bound and show that a simple algorithm achieves this rate. Finally, we empirically demonstrate our algorithm's use on real-world and simulated graph transfer problems.

Transfer Learning for Latent Variable Network Models

TL;DR

The paper addresses transfer learning for latent-variable network models, where the edge probability matrix

is learned using a fully observed source

and sparse target data. It introduces a rankings-based algorithm that exploits shared latent structure via a graph-distance to align latent positions without requiring a parametric form, achieving vanishing estimation error. The authors establish a minimax lower bound for Stochastic Block Models and show a simple clustering-based method attains the rate in the appropriate regime, supported by empirical results on simulated and real networks that illustrate effective cross-domain transfer. Overall, the work demonstrates practical transfer capabilities for estimating latent-network structures across domains, with theoretically justified rates and real-data validation.

Abstract

We study transfer learning for estimation in latent variable network models. In our setting, the conditional edge probability matrices given the latent variables are represented by

for the source and

for the target. We wish to estimate

given two kinds of data: (1) edge data from a subgraph induced by an

fraction of the nodes of

, and (2) edge data from all of

. If the source

has no relation to the target

, the estimation error must be

. However, we show that if the latent variables are shared, then vanishing error is possible. We give an efficient algorithm that utilizes the ordering of a suitably defined graph distance. Our algorithm achieves

error and does not assume a parametric form on the source or target networks. Next, for the specific case of Stochastic Block Models we prove a minimax lower bound and show that a simple algorithm achieves this rate. Finally, we empirically demonstrate our algorithm's use on real-world and simulated graph transfer problems.

Paper Structure (19 sections, 34 theorems, 68 equations, 2 figures, 2 tables, 2 algorithms)

This paper contains 19 sections, 34 theorems, 68 equations, 2 figures, 2 tables, 2 algorithms.

Introduction
Other Related work
Estimating Latent Variable Models with Rankings
Minimax Rates for Stochastic Block Models
Experiments
Conclusion
Acknowledgments
Proofs
Preliminaries
Proof of Theorem \ref{['thm:alg1error']}
Latent distance to graph distance
Graph Distance Concentration
Control of smoothing error
Control of second smoothing error
Overall Error
...and 4 more sections

Key Result

Theorem 1.1

There exists an efficient algorithm such that, if given source data $A_P \in \{0,1\}^{n \times n}$ and target data $A_Q \in \{0,1\}^{n_Q \times n_Q}$ coming from an appropriate pair $(f_P, f_Q)$ of latent variable models, outputs $\hat{Q} \in \mathbb{R}^{n \times n}$ such that:

Figures (2)

Figure 1: Comparison of algorithms on three source-target pairs ($n = 2000, n_Q = 500$). In each heatmap, the lower triangle is the target $Q$. Algorithm \ref{['alg:q-perfect-clustering']} performs best when $(P, Q)$ are SBMs (top), while Algorithm \ref{['alg:q-averaging-row-wise']} is better for smooth graphons (2nd and 3rd row).
Figure 2: Results of network estimation on real-world data. Shaded regions denote $[1, 99]$ percentile outcomes from $50$ trials. Left half: Estimating metabolic network of iJN1463 ( Pseudomonas putida) with source iWFL1372 ( Escherichia coli W) leftmost, and source iPC815 ( Yersinia pestis) second-left. Right half: Using source data from days $1-80$ of Email-EU to estimate target days $81-160$ (third-left) and target days $561-640$ (rightmost). Note that we use smaller values of $p_{flip}$ for the Oracle in Email-EU.

Theorems & Definitions (63)

Theorem 1.1: Theorem \ref{['thm:alg1error']}, Informal
Definition 1.2: Graph Distance
Definition 1.3: Rankings Assumption at quantile $h_n$
Definition 2.1
Theorem 2.3
Definition 3.1: SBM
Theorem 3.2: Minimax Lower Bound for SBMs
Remark 3.3: Clustering Regime
Proposition 3.4: Error rate of Algorithm \ref{['alg:q-perfect-clustering']}
Lemma A.1: hoeffding1994probability
...and 53 more

Transfer Learning for Latent Variable Network Models

TL;DR

Abstract

Transfer Learning for Latent Variable Network Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (63)