Table of Contents
Fetching ...

Role extraction by matrix equations and generalized random walks

Dario Fasino

TL;DR

The paper addresses the problem of extracting node roles in directed networks with heterogeneous degree distributions by introducing a direction-aware node similarity $S^*$ as the solution to $S - \frac{\beta^2}{2}(P S P^T + Q S Q^T) = S_1$, where $P = D_{out}^{-1}A$, $Q = D_{in}^{-1}A^T$, and $S_1 = P P^T + Q Q^T$. It develops a globally convergent iteration to compute $S^*$, interprets the entries via generalized random $\Psi$-walks, and analyzes the limiting behavior as $\beta$ varies, including a universal limit near $\beta^2=1$ and a baseline near $\beta=0$. A key theoretical result shows an invariance of $S^*$ under degree corrections in degree-corrected SBM settings: for $A = D_1\Theta B\Theta^T D_2$, $S^* = \Theta X \Theta^T$ with $X$ solving a reduced matrix equation; this yields block-constant structure and rank equal to the number of blocks, implying robust role recovery. Numerical experiments on synthetic SBMs and a real faculty-hiring network illustrate that $S^*$ reliably identifies roles in directed graphs with heterogeneous degrees, outperforming the traditional Browet–Van Dooren style similarity in the presence of degree variation. The work also discusses extensions to weighted networks and practical considerations, including computational cost and potential low-rank approximations for large-scale graphs.

Abstract

The nodes in a network can be grouped into 'roles' based on similar connection patterns. This is usually achieved by defining a pairwise node similarity matrix and then clustering rows and columns of this matrix. This paper presents a new similarity matrix for solving role extraction problems in directed networks, which is defined as the solution of a matrix equation and computes node similarities based on random walks that can proceed along the link direction and in the opposite direction. The resulting node similarity measure performs remarkably in role extraction tasks on directed networks with heterogeneous node degree distributions.

Role extraction by matrix equations and generalized random walks

TL;DR

The paper addresses the problem of extracting node roles in directed networks with heterogeneous degree distributions by introducing a direction-aware node similarity as the solution to , where , , and . It develops a globally convergent iteration to compute , interprets the entries via generalized random -walks, and analyzes the limiting behavior as varies, including a universal limit near and a baseline near . A key theoretical result shows an invariance of under degree corrections in degree-corrected SBM settings: for , with solving a reduced matrix equation; this yields block-constant structure and rank equal to the number of blocks, implying robust role recovery. Numerical experiments on synthetic SBMs and a real faculty-hiring network illustrate that reliably identifies roles in directed graphs with heterogeneous degrees, outperforming the traditional Browet–Van Dooren style similarity in the presence of degree variation. The work also discusses extensions to weighted networks and practical considerations, including computational cost and potential low-rank approximations for large-scale graphs.

Abstract

The nodes in a network can be grouped into 'roles' based on similar connection patterns. This is usually achieved by defining a pairwise node similarity matrix and then clustering rows and columns of this matrix. This paper presents a new similarity matrix for solving role extraction problems in directed networks, which is defined as the solution of a matrix equation and computes node similarities based on random walks that can proceed along the link direction and in the opposite direction. The resulting node similarity measure performs remarkably in role extraction tasks on directed networks with heterogeneous node degree distributions.

Paper Structure

This paper contains 17 sections, 12 theorems, 46 equations, 4 figures, 1 algorithm.

Key Result

Theorem 2.1

Let $A\in\mathbb{R}^{n\times n}$ be a matrix with nonnegative entries. If there exists a positive vector $v\in\mathbb{R}^n$ such that $Av = \lambda v$ for some scalar $\lambda > 0$ then $\rho(A) = \lambda$. In addition, if $A$ is also irreducible then $\rho(A)$ is a simple eigenvalue, and an associa

Figures (4)

  • Figure 1: Top row: A random adjacency matrix drawn from a Stochastic Block Model (left) and the corresponding node similarity matrices $S$ (center) and $S^*$ (right), together with the cluster assignments computed by row clustering. Bottom row: Same as above, but using a degree-corrected SBM.
  • Figure 2: Top row: The average matrix of a SBM (left) and the corresponding similarity matrices $S$ (center) and $S^*$ (right). Darker colors represent smaller entries. Bottom row: Same as above but with a degree-corrected SBM.
  • Figure 3: Left: Adjacency matrix rearranged placing nodes consecutively within each role. The diagonal blocks are in color to highlight position, size and consistency of each role. Center: Scatter plot of node in-/out-degrees. Right: NRC rank vs node index.
  • Figure 4: Number of times each node in the network has been assigned a particular role by the $k$-means algorithm. The roles are represented by the same color scheme as in Figure \ref{['fig:3']}.

Theorems & Definitions (23)

  • Theorem 2.1
  • Definition 3.1
  • Theorem 3.2
  • proof
  • Theorem 3.3
  • Lemma 4.1
  • proof
  • Lemma 4.2
  • proof
  • Theorem 4.3
  • ...and 13 more