Random Abstract Cell Complexes
Josef Hoppe, Michael T. Schaub
TL;DR
Random Abstract Cell Complexes (RCC) introduce a probabilistic framework to generate 2D abstract cell complexes by iteratively lifting a base Erdős–Rényi graph with boundary-size dependent inclusion probabilities for higher-dimensional cells. The work develops a spanning-tree based approach to tame the cycle space, and two practical approximate algorithms: one for counting cycles of given length and one for sampling 2-cells with a target probability, enabling scalable null-models and graph liftings for higher-order networks. The authors provide theoretical complexity results, analyze limitations, and validate the methods empirically across synthetic and real-world graphs, highlighting applications as null models, experiment baselines, and tools for sensitivity analyses in higher-order learning. Together, the results establish RCC as a versatile foundation for studying orientability, homology, and spectral properties of random CCs and for generating synthetic higher-order data with controllable cycle structure.
Abstract
We define a model for random (abstract) cell complexes (CCs), similiar to the well-known Erdős-Rényi model for graphs and its extensions for simplicial complexes. To build a random cell complex, we first draw from an Erdős-Rényi graph, and consecutively augment the graph with cells for each dimension with a specified probability. As the number of possible cells increases combinatorially -- e.g., 2-cells can be represented as cycles, or permutations -- we derive an approximate sampling algorithm for this model limited to two-dimensional abstract cell complexes. As a basis for this algorithm, we first introduce a spanning-tree-based method that samples simple cycles and allows the efficient approximation of various properties, most notably the probability of occurence of a given cycle. This approximation is of independent interest as it enables the approximation of a wide variety of cycle-related graph statistics using importance sampling. We use this to approximate the number of cycles of a given length on a graph, allowing us to calculate the sampling probability to arrive at a desired expected number of sampled 2-cells. The probability approximation also trivially leads to a sampling algorithm for $2$-cells with a desired sampling probability. We provide some initial analysis into the properties of random CCs drawn from this model. We further showcase practical applications for our random CCs as null models, and in the context of (random) liftings of graphs to cell complexes. The cycle sampling, cycle count estimation, and combined cell sampling algorithms are available in the package `py-raccoon` on the Python Packaging Index.
