Table of Contents
Fetching ...

Importance Ranking in Complex Networks via Influence-aware Causal Node Embedding

Jiahui Gao, Kuang Zhou, Yuchen Zhu, Keyu Wu

TL;DR

This work tackles the problem of ranking node importance in complex networks when the target topology is not accessible. It introduces ICAN, an influence-aware causal autoencoder that learns node embeddings causally linked to node influence via an SIR-based score and a Markov-blanket–driven ranking module, trained with a unified objective including a causal reconstruction loss and a causal ranking loss under a DAG acyclicity constraint. Empirically, ICAN achieves state-of-the-art ranking accuracy and superior cross-network transferability when trained on synthetic networks and evaluated on diverse real-world graphs, with ablation studies confirming the contributions of the causal embedding and joint optimization. The approach offers practical impact for privacy-preserving network analysis and engineering tasks, enabling reliable node-ranking transfer without access to target network structure, and suggests avenues for future work in training-network selection and cost-aware interventions.

Abstract

Understanding and quantifying node importance is a fundamental problem in network science and engineering, underpinning a wide range of applications such as influence maximization, social recommendation, and network dismantling. Prior research often relies on centrality measures or advanced graph embedding techniques using structural information, followed by downstream classification or regression tasks to identify critical nodes. However, these methods typically decouple node representation learning from the ranking objective and rely on the topological structure of target networks, leading to feature-task inconsistency and limited generalization across networks. This paper proposes a novel framework that leverages causal representation learning to get robust, invariant node embeddings for cross-network ranking tasks. Firstly, we introduce an influence-aware causal node embedding module within an autoencoder architecture to extract node embeddings that are causally related to node importance. Moreover, we introduce a causal ranking loss and design a unified optimization framework that jointly optimizes the reconstruction and ranking objectives, enabling mutual reinforcement between node representation learning and ranking optimization. This design allows the proposed model to be trained on synthetic networks and to generalize effectively across diverse real-world networks. Extensive experiments on multiple benchmark datasets demonstrate that the proposed model consistently outperforms state-of-the-art baselines in terms of both ranking accuracy and cross-network transferability, offering new insights for network analysis and engineering applications-particularly in scenarios where the target network's structure is inaccessible in advance due to privacy or security constraints.

Importance Ranking in Complex Networks via Influence-aware Causal Node Embedding

TL;DR

This work tackles the problem of ranking node importance in complex networks when the target topology is not accessible. It introduces ICAN, an influence-aware causal autoencoder that learns node embeddings causally linked to node influence via an SIR-based score and a Markov-blanket–driven ranking module, trained with a unified objective including a causal reconstruction loss and a causal ranking loss under a DAG acyclicity constraint. Empirically, ICAN achieves state-of-the-art ranking accuracy and superior cross-network transferability when trained on synthetic networks and evaluated on diverse real-world graphs, with ablation studies confirming the contributions of the causal embedding and joint optimization. The approach offers practical impact for privacy-preserving network analysis and engineering tasks, enabling reliable node-ranking transfer without access to target network structure, and suggests avenues for future work in training-network selection and cost-aware interventions.

Abstract

Understanding and quantifying node importance is a fundamental problem in network science and engineering, underpinning a wide range of applications such as influence maximization, social recommendation, and network dismantling. Prior research often relies on centrality measures or advanced graph embedding techniques using structural information, followed by downstream classification or regression tasks to identify critical nodes. However, these methods typically decouple node representation learning from the ranking objective and rely on the topological structure of target networks, leading to feature-task inconsistency and limited generalization across networks. This paper proposes a novel framework that leverages causal representation learning to get robust, invariant node embeddings for cross-network ranking tasks. Firstly, we introduce an influence-aware causal node embedding module within an autoencoder architecture to extract node embeddings that are causally related to node importance. Moreover, we introduce a causal ranking loss and design a unified optimization framework that jointly optimizes the reconstruction and ranking objectives, enabling mutual reinforcement between node representation learning and ranking optimization. This design allows the proposed model to be trained on synthetic networks and to generalize effectively across diverse real-world networks. Extensive experiments on multiple benchmark datasets demonstrate that the proposed model consistently outperforms state-of-the-art baselines in terms of both ranking accuracy and cross-network transferability, offering new insights for network analysis and engineering applications-particularly in scenarios where the target network's structure is inaccessible in advance due to privacy or security constraints.

Paper Structure

This paper contains 31 sections, 2 theorems, 18 equations, 7 figures, 6 tables, 1 algorithm.

Key Result

Lemma 1

Let $\mathbb{G} = (U, E)$ be a graph with adjacency matrix $\bm{W} \in \mathbb{R}^{(p+1)\times(p+1)}$. Then, the $(i,j)$-th element of $\bm{W} ^k$, denoted by $\bm{W} _{ij}^{(k)}$, represents the number of paths of length $k$ from node $v_i$ to node $v_j$, where $i,j = 1,\ldots,p+1$.

Figures (7)

  • Figure 1: The framework outline of ICAN.
  • Figure 2: Ablation study results.
  • Figure 3: Parameters sensitivity study on all the real-world networks for ICAN.
  • Figure 4: Degree distribution of the detected top $10\%$ important nodes in Jazz network.
  • Figure 5: Degree distribution of the detected top $10\%$ important nodes in USAir network.
  • ...and 2 more figures

Theorems & Definitions (3)

  • Lemma 1
  • Theorem 1
  • proof