Fast algorithm for detecting community structure in networks
M. E. J. Newman
TL;DR
Many networks exhibit community structure, but existing edge-betweenness methods are computationally intensive. The authors propose a modularity-guided greedy agglomerative algorithm that merges communities to maximize Q, computing ΔQ efficiently and representing progress as a dendrogram. The method runs in O((m+n)n) time and scales to networks with millions of vertices, delivering results orders of magnitude faster than prior approaches. It yields meaningful community divisions in both synthetic benchmarks and large real-world networks (e.g., a 56,276-node arXiv collaboration network), enabling rapid exploration and visualization of complex network structure.
Abstract
It has been found that many networks display community structure -- groups of vertices within which connections are dense but between which they are sparser -- and highly sensitive computer algorithms have in recent years been developed for detecting such structure. These algorithms however are computationally demanding, which limits their application to small networks. Here we describe a new algorithm which gives excellent results when tested on both computer-generated and real-world networks and is much faster, typically thousands of times faster than previous algorithms. We give several example applications, including one to a collaboration network of more than 50000 physicists.
