Chemically inspired Erdős-Rényi oriented hypergraphs
Angel Garcia-Chung, Marisol Bermúdez-Montaña, Peter F. Stadler, Jürgen Jost, Guillermo Restrepo
TL;DR
This work introduces an Erdős-Rényi-like model for oriented hypergraphs to capture high-order chemical space representations, where hypervertices encode educts and products and hyperedges represent reactions with directionality. By deriving exact counts for the complete oriented hypergraph (e.g., $u_r(n)=\frac12(3^n-2^{n+1}+1)$) and per-vertex participation $u(n)=3^{n-1}-2^{n-1}$, the authors establish a framework linking the random wiring probability $p$ to the expected number of reactions and their sizes. A key result is that, for large $n$, the ratio of expected total reactions to expected per-vertex degree satisfies $\frac{\mathbb{E}[R]}{\mathbb{E}[D]} \to \frac{3}{2}$, providing a robust null model for assessing real chemical spaces; the model also yields p-independent size distributions $P(s)=\frac{u_s}{u_r}$ and various growth regimes $\mathbb{E}[R]\sim p\,u_r$ under parametrizations like $p= n^\alpha/\beta^n$. The framework paves the way for phase-transition analysis in higher-order random structures and offers a quantitative benchmark for comparing actual chemical spaces against random wiring, with potential extensions to directed hypergraphs and stoichiometric constraints.
Abstract
High-order structures have been recognised as suitable models for systems going beyond the binary relationships for which graph models are appropriate. Despite their importance and surge in research on these structures, their random cases have been only recently become subjects of interest. One of these high-order structures is the oriented hypergraph, which relates couples of subsets of an arbitrary number of vertices. Here we develop the Erdős-Rényi model for oriented hypergraphs, which corresponds to the random realisation of oriented hyperedges of the complete oriented hypergraph. A particular feature of random oriented hypergraphs is that the ratio between their expected number of oriented hyperedges and their expected degree or size is 3/2 for large number of vertices. We highlight the suitability of oriented hypergraphs for modelling large collections of chemical reactions and the importance of random oriented hypergraphs to analyse the unfolding of chemistry.
