Table of Contents
Fetching ...

Transfer Learning for Latent Variable Network Models

Akhil Jalan, Arya Mazumdar, Soumendu Sundar Mukherjee, Purnamrita Sarkar

TL;DR

The paper addresses transfer learning for latent-variable network models, where the edge probability matrix $Q$ is learned using a fully observed source $P$ and sparse target data. It introduces a rankings-based algorithm that exploits shared latent structure via a graph-distance to align latent positions without requiring a parametric form, achieving vanishing estimation error. The authors establish a minimax lower bound for Stochastic Block Models and show a simple clustering-based method attains the rate in the appropriate regime, supported by empirical results on simulated and real networks that illustrate effective cross-domain transfer. Overall, the work demonstrates practical transfer capabilities for estimating latent-network structures across domains, with theoretically justified rates and real-data validation.

Abstract

We study transfer learning for estimation in latent variable network models. In our setting, the conditional edge probability matrices given the latent variables are represented by $P$ for the source and $Q$ for the target. We wish to estimate $Q$ given two kinds of data: (1) edge data from a subgraph induced by an $o(1)$ fraction of the nodes of $Q$, and (2) edge data from all of $P$. If the source $P$ has no relation to the target $Q$, the estimation error must be $Ω(1)$. However, we show that if the latent variables are shared, then vanishing error is possible. We give an efficient algorithm that utilizes the ordering of a suitably defined graph distance. Our algorithm achieves $o(1)$ error and does not assume a parametric form on the source or target networks. Next, for the specific case of Stochastic Block Models we prove a minimax lower bound and show that a simple algorithm achieves this rate. Finally, we empirically demonstrate our algorithm's use on real-world and simulated graph transfer problems.

Transfer Learning for Latent Variable Network Models

TL;DR

The paper addresses transfer learning for latent-variable network models, where the edge probability matrix is learned using a fully observed source and sparse target data. It introduces a rankings-based algorithm that exploits shared latent structure via a graph-distance to align latent positions without requiring a parametric form, achieving vanishing estimation error. The authors establish a minimax lower bound for Stochastic Block Models and show a simple clustering-based method attains the rate in the appropriate regime, supported by empirical results on simulated and real networks that illustrate effective cross-domain transfer. Overall, the work demonstrates practical transfer capabilities for estimating latent-network structures across domains, with theoretically justified rates and real-data validation.

Abstract

We study transfer learning for estimation in latent variable network models. In our setting, the conditional edge probability matrices given the latent variables are represented by for the source and for the target. We wish to estimate given two kinds of data: (1) edge data from a subgraph induced by an fraction of the nodes of , and (2) edge data from all of . If the source has no relation to the target , the estimation error must be . However, we show that if the latent variables are shared, then vanishing error is possible. We give an efficient algorithm that utilizes the ordering of a suitably defined graph distance. Our algorithm achieves error and does not assume a parametric form on the source or target networks. Next, for the specific case of Stochastic Block Models we prove a minimax lower bound and show that a simple algorithm achieves this rate. Finally, we empirically demonstrate our algorithm's use on real-world and simulated graph transfer problems.
Paper Structure (19 sections, 34 theorems, 68 equations, 2 figures, 2 tables, 2 algorithms)

This paper contains 19 sections, 34 theorems, 68 equations, 2 figures, 2 tables, 2 algorithms.

Key Result

Theorem 1.1

There exists an efficient algorithm such that, if given source data $A_P \in \{0,1\}^{n \times n}$ and target data $A_Q \in \{0,1\}^{n_Q \times n_Q}$ coming from an appropriate pair $(f_P, f_Q)$ of latent variable models, outputs $\hat{Q} \in \mathbb{R}^{n \times n}$ such that:

Figures (2)

  • Figure 1: Comparison of algorithms on three source-target pairs ($n = 2000, n_Q = 500$). In each heatmap, the lower triangle is the target $Q$. Algorithm \ref{['alg:q-perfect-clustering']} performs best when $(P, Q)$ are SBMs (top), while Algorithm \ref{['alg:q-averaging-row-wise']} is better for smooth graphons (2nd and 3rd row).
  • Figure 2: Results of network estimation on real-world data. Shaded regions denote $[1, 99]$ percentile outcomes from $50$ trials. Left half: Estimating metabolic network of iJN1463 ( Pseudomonas putida) with source iWFL1372 ( Escherichia coli W) leftmost, and source iPC815 ( Yersinia pestis) second-left. Right half: Using source data from days $1-80$ of Email-EU to estimate target days $81-160$ (third-left) and target days $561-640$ (rightmost). Note that we use smaller values of $p_{flip}$ for the Oracle in Email-EU.

Theorems & Definitions (63)

  • Theorem 1.1: Theorem \ref{['thm:alg1error']}, Informal
  • Definition 1.2: Graph Distance
  • Definition 1.3: Rankings Assumption at quantile $h_n$
  • Definition 2.1
  • Theorem 2.3
  • Definition 3.1: SBM
  • Theorem 3.2: Minimax Lower Bound for SBMs
  • Remark 3.3: Clustering Regime
  • Proposition 3.4: Error rate of Algorithm \ref{['alg:q-perfect-clustering']}
  • Lemma A.1: hoeffding1994probability
  • ...and 53 more