Multi-Scale Node Embeddings for Graph Modeling and Generation

Riccardo Milocco; Fabian Jansen; Diego Garlaschelli

Multi-Scale Node Embeddings for Graph Modeling and Generation

Riccardo Milocco, Fabian Jansen, Diego Garlaschelli

TL;DR

The paper tackles the challenge of learning node embeddings that remain faithful across multiple graph resolutions. It introduces the Multi-Scale Model (MSM), a vector-based, scale-invariant framework in which block embeddings equal the sum of their constituent node embeddings, enabling consistent graph modeling and generation under arbitrary coarse-graining. Through applications to the ING Input-Output Network and the World Trade Web, MSM demonstrates superior scale-consistency, accurate replication of clustering and triangle structure, and robust reconstruction performance across scales compared to single-scale LPCA. This multiscale approach provides a principled mechanism to study complex networks at varying resolutions and supports faithful generation of coarse-grained networks from fine-grained embeddings.

Abstract

Lying at the interface between Network Science and Machine Learning, node embedding algorithms take a graph as input and encode its structure onto output vectors that represent nodes in an abstract geometric space, enabling various vector-based downstream tasks such as network modelling, data compression, link prediction, and community detection. Two apparently unrelated limitations affect these algorithms. On one hand, it is not clear what the basic operation defining vector spaces, i.e. the vector sum, corresponds to in terms of the original nodes in the network. On the other hand, while the same input network can be represented at multiple levels of resolution by coarse-graining the constituent nodes into arbitrary block-nodes, the relationship between node embeddings obtained at different hierarchical levels is not understood. Here, building on recent results in network renormalization theory, we address these two limitations at once and define a multiscale node embedding method that, upon arbitrary coarse-grainings, ensures statistical consistency of the embedding vector of a block-node with the sum of the embedding vectors of its constituent nodes. We illustrate the power of this approach on two economic networks that can be naturally represented at multiple resolution levels: namely, the international trade between (sets of) countries and the input-output flows among (sets of) industries in the Netherlands. We confirm the statistical consistency between networks retrieved from coarse-grained node vectors and networks retrieved from sums of fine-grained node vectors, a result that cannot be achieved by alternative methods. Several key network properties, including a large number of triangles, are successfully replicated already from embeddings of very low dimensionality, allowing for the generation of faithful replicas of the original networks at arbitrary resolution levels.

Multi-Scale Node Embeddings for Graph Modeling and Generation

TL;DR

Abstract

Multi-Scale Node Embeddings for Graph Modeling and Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)