Tight Practical Bounds for Subgraph Densities in Ego-centric Networks
Connor Mattes, Esha Datta, Ali Pinar
TL;DR
The paper addresses whether local network structure in ego-centric graphs is shaped by social factors or by intrinsic mathematical constraints. It combines plain flag algebra-based bounds with motif counting and topological data analysis to define the subgraph spread ratio, a metric that quantifies how much of the feasible region of localized subgraph densities is realized. Key contributions include substantially tighter feasible regions (up to about threefold improvement over prior bounds), the introduction of the subgraph spread ratio, and empirical validation across 11 real networks showing social networks have smaller spreads than linkage graphs. The work provides a practical, scalable tool for network comparison and offers a framework for interpreting local graph structure in terms of domain-driven versus mathematically-determined factors.
Abstract
Subgraph densities play a crucial role in network analysis, especially for the identification and interpretation of meaningful substructures in complex graphs. Localized subgraph densities, in particular, can provide valuable insights into graph structures. Distinguishing between mathematically-determined and domain-driven subgraph density features, however, poses challenges. For instance, the lack or presence of certain structures can be explained by graph density or degree distribution. These differences are especially meaningful in applied contexts as they allow us to identify instances where the data induces specific network structures, such as friendships in social networks. The goal of this paper is to measure these differences across various types of graphs, conducting social media analysis from a network perspective. To this end, we first provide tighter bounds on subgraph densities. We then introduce the subgraph spread ratio to quantify the realized subgraph densities of specific networks relative to the feasible bounds. Our novel approach combines techniques from flag algebras, motif-counting, and topological data analysis. Crucially, effective adoption of the state-of-the-art in the plain flag algebra method yields feasible regions up to three times tighter than prior best-known results, thereby enabling more accurate and direct comparisons across graphs. We additionally perform an empirical analysis of 11 real-world networks. We observe that social networks consistently have smaller subgraph spread ratios than other types of networks, such as linkage-mapping networks for Wikipedia pages. This aligns with our intuition about social relationships: such networks have meaningful structure that makes them distinct. The subgraph spread ratio enables the quantification of intuitive understandings of network structures and provides a metric for comparing types of networks.
