Efficient Network Embedding by Approximate Equitable Partitions
Giuseppe Squillace, Mirco Tribastone, Max Tschaikowski, Andrea Vandin
TL;DR
This work addresses scalable structural network embedding by introducing ε-BE, an approximate equitable-partition approach that relaxes exact BE with a tunable tolerance $\varepsilon$ and computes embeddings via a partition-refinement algorithm. An iterative scheme progressively increases $\varepsilon$ to balance aggregation and discrimination, achieving embeddings in $ℝ^{n×d}$ where $d$ equals the number of BE blocks. Empirical results on visualization, classification, and regression tasks show competitive or superior performance with runtimes that are orders of magnitude faster than state-of-the-art methods, enabling large-scale network embeddings previously impractical. The approach leverages BE's relation to Markov-lumping and O($m\log n$) refinement, with the overall iterative complexity scaling as $O(mnΔ)$, demonstrating strong scalability and practical impact for complex network analysis.
Abstract
Structural network embedding is a crucial step in enabling effective downstream tasks for complex systems that aims to project a network into a lower-dimensional space while preserving similarities among nodes. We introduce a simple and efficient embedding technique based on approximate variants of equitable partitions. The approximation consists in introducing a user-tunable tolerance parameter relaxing the otherwise strict condition for exact equitable partitions that can be hardly found in real-world networks. We exploit a relationship between equitable partitions and equivalence relations for Markov chains and ordinary differential equations to develop a partition refinement algorithm for computing an approximate equitable partition in polynomial time. We compare our method against state-of-the-art embedding techniques on benchmark networks. We report comparable -- when not superior -- performance for visualization, classification, and regression tasks at a cost between one and three orders of magnitude smaller using a prototype implementation, enabling the embedding of large-scale networks which could not be efficiently handled by most of the competing techniques.
