Community Detection with the Map Equation and Infomap: Theory and Applications
Jelena Smiljanić, Christopher Blöcker, Anton Holmgren, Daniel Edler, Magnus Neuman, Martin Rosvall
TL;DR
The paper presents the map equation and Infomap as a principled, information-theoretic approach to detect flow-based communities in networks, framing community detection as a compression problem for random walks. It surveys representations from simple graphs to memory, multilayer, temporal, and hypergraph models, and details how the map equation organizes flows via module and index codebooks to minimize description length. It covers algorithmic aspects (two-level and multilevel Infomap), remedies to common challenges (resolution and field-of-view limits, Markov-time scaling, higher-order modeling), and extensions that incorporate node attributes, incomplete data, and higher-order interactions. The work also highlights software tools, visualization platforms, and diverse applications including centrality, similarity, bioregions, and model selection, illustrating practical impact across biology, ecology, and networks science. Overall, it provides a comprehensive, actionable framework for applying flow-based community detection to complex systems with rich representations and robust inference methods.
Abstract
Real-world networks have a complex topology comprising many elements often structured into communities. Revealing these communities helps researchers uncover the organizational and functional structure of the system that the network represents. However, detecting community structures in complex networks requires selecting a community detection method among a multitude of alternatives with different network representations, community interpretations, and underlying mechanisms. This tutorial focuses on a popular community detection method called the map equation and its search algorithm Infomap. The map equation framework for community detection describes communities by analyzing dynamic processes on the network. Thanks to its flexibility, the map equation provides extensions that can incorporate various assumptions about network structure and dynamics. To help decide if the map equation is a suitable community detection method for a given complex system and problem at hand - and which variant to choose - we review the map equation's theoretical framework and guide users in applying the map equation to various research problems.
