ffstruc2vec: Flat, Flexible and Scalable Learning of Node Representations from Structural Identities

Mario Heidrich; Jeffrey Heidemann; Rüdiger Buchkremer; Gonzalo Wandosell Fernández de Bobadilla

ffstruc2vec: Flat, Flexible and Scalable Learning of Node Representations from Structural Identities

Mario Heidrich, Jeffrey Heidemann, Rüdiger Buchkremer, Gonzalo Wandosell Fernández de Bobadilla

TL;DR

ffstruc2vec addresses the need for scalable node embeddings that preserve structural identities across diverse downstream tasks. It builds a flat similarity graph from multiple graph indicators, learns embeddings via biased random walks and Skip-gram, and then applies task-aware optimization to tailor representations to specific applications. The method delivers greater flexibility, interpretability, and scalability than prior work like struc2vec, with empirical gains on unsupervised and supervised benchmarks across synthetic and real networks. This framework enables explainable reasoning about which structural motifs drive downstream outcomes, making it practical for large-scale graphs in domains such as fraud detection and air-traffic analysis.

Abstract

Node embedding refers to techniques that generate low-dimensional vector representations of nodes in a graph while preserving specific properties of the nodes. A key challenge in the field is developing scalable methods that can preserve structural properties suitable for the required types of structural patterns of a given downstream application task. While most existing methods focus on preserving node proximity, those that do preserve structural properties often lack the flexibility to preserve various types of structural patterns required by downstream application tasks. This paper introduces ffstruc2vec, a scalable deep-learning framework for learning node embedding vectors that preserve structural identities. Its flat, efficient architecture allows high flexibility in capturing diverse types of structural patterns, enabling broad adaptability to various downstream application tasks. The proposed framework significantly outperforms existing approaches across diverse unsupervised and supervised tasks in practical applications. Moreover, ffstruc2vec enables explainability by quantifying how individual structural patterns influence task outcomes, providing actionable interpretation. To our knowledge, no existing framework combines this level of flexibility, scalability, and structural interpretability, underscoring its unique capabilities.

ffstruc2vec: Flat, Flexible and Scalable Learning of Node Representations from Structural Identities

TL;DR

Abstract

ffstruc2vec: Flat, Flexible and Scalable Learning of Node Representations from Structural Identities

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (16)

Theorems & Definitions (8)