The Artificial Benchmark for Community Detection with Outliers and Overlapping Communities (ABCD+$o^2$)
Jordan Barrett, Ryan DeWolfe, Bogumił Kamiński, Paweł Prałat, Aaron Smith, François Théberge
TL;DR
This paper addresses the need for scalable, analytically tractable benchmarks for overlapping community detection with outliers. It introduces ABCD+o^2, a six-phase generative framework that uses a hidden geometric reference layer to produce overlapping communities and a background edge process, all governed by power-law degree and community-size distributions. The authors validate the model against real networks (DBLP, Amazon, YouTube) and show controllable overlap, degree-communities correlation, and density properties, while demonstrating its utility for benchmarking through comparative experiments with multiple algorithms. The work provides a flexible, fast, and interpretable benchmark that supports systematic study of overlap and noise in community detection.
Abstract
The Artificial Benchmark for Community Detection (ABCD) graph is a random graph model with community structure and power-law distribution for both degrees and community sizes. The model generates graphs similar to the well-known LFR model but it is faster, more interpretable, and can be investigated analytically. In this paper, we use the underlying ingredients of the ABCD model, and its generalization to include outliers (ABCD+$o$), and introduce another variant that allows for overlapping communities, ABCD+$o^2$.
