Table of Contents
Fetching ...

A Deep Latent Space Model for Graph Representation Learning

Hanxuan Yang, Qingchao Kong, Wenji Mao

TL;DR

This paper proposes a Deep Latent Space Model (DLSM) for directed graphs to incorporate the traditional latent variable based generative model into deep learning frameworks and achieves the state-of-the-art performances on both link prediction and community detection tasks while learning interpretable node embeddings.

Abstract

Graph representation learning is a fundamental problem for modeling relational data and benefits a number of downstream applications. Traditional Bayesian-based graph models and recent deep learning based GNN either suffer from impracticability or lack interpretability, thus combined models for undirected graphs have been proposed to overcome the weaknesses. As a large portion of real-world graphs are directed graphs (of which undirected graphs are special cases), in this paper, we propose a Deep Latent Space Model (DLSM) for directed graphs to incorporate the traditional latent variable based generative model into deep learning frameworks. Our proposed model consists of a graph convolutional network (GCN) encoder and a stochastic decoder, which are layer-wise connected by a hierarchical variational auto-encoder architecture. By specifically modeling the degree heterogeneity using node random factors, our model possesses better interpretability in both community structure and degree heterogeneity. For fast inference, the stochastic gradient variational Bayes (SGVB) is adopted using a non-iterative recognition model, which is much more scalable than traditional MCMC-based methods. The experiments on real-world datasets show that the proposed model achieves the state-of-the-art performances on both link prediction and community detection tasks while learning interpretable node embeddings. The source code is available at https://github.com/upperr/DLSM.

A Deep Latent Space Model for Graph Representation Learning

TL;DR

This paper proposes a Deep Latent Space Model (DLSM) for directed graphs to incorporate the traditional latent variable based generative model into deep learning frameworks and achieves the state-of-the-art performances on both link prediction and community detection tasks while learning interpretable node embeddings.

Abstract

Graph representation learning is a fundamental problem for modeling relational data and benefits a number of downstream applications. Traditional Bayesian-based graph models and recent deep learning based GNN either suffer from impracticability or lack interpretability, thus combined models for undirected graphs have been proposed to overcome the weaknesses. As a large portion of real-world graphs are directed graphs (of which undirected graphs are special cases), in this paper, we propose a Deep Latent Space Model (DLSM) for directed graphs to incorporate the traditional latent variable based generative model into deep learning frameworks. Our proposed model consists of a graph convolutional network (GCN) encoder and a stochastic decoder, which are layer-wise connected by a hierarchical variational auto-encoder architecture. By specifically modeling the degree heterogeneity using node random factors, our model possesses better interpretability in both community structure and degree heterogeneity. For fast inference, the stochastic gradient variational Bayes (SGVB) is adopted using a non-iterative recognition model, which is much more scalable than traditional MCMC-based methods. The experiments on real-world datasets show that the proposed model achieves the state-of-the-art performances on both link prediction and community detection tasks while learning interpretable node embeddings. The source code is available at https://github.com/upperr/DLSM.

Paper Structure

This paper contains 25 sections, 9 equations, 3 figures, 4 tables.

Figures (3)

  • Figure 1: The architecture of hierarchical variational auto-encoder. The deterministic GNN (blue) encodes the adjacency matrix and attribute (if available) inputs as hidden states ${\hbox{\bf{H}}}^{(l)}$, which are then passed to the LSM decoder to generate the interpretable latent representations $\Theta^{(l)}$. For inference, the posterior variational distributions are also dependent on the GNN likelihood of the corresponding layers (orange arrows).
  • Figure 2: Visualizations of the node embeddings learned on the Email network using a 2D t-SNE projection. Colors denote the ground-truth communities.
  • Figure 3: Probability density distributions of the degrees and node random factors learned by DLSM.