Counting Cohesive Subgraphs with Hereditary Properties

Rong-Hua Li; Xiaowei Ye; Fusheng Jin; Yu-Ping Wang; Ye Yuan; Guoren Wang

Counting Cohesive Subgraphs with Hereditary Properties

Rong-Hua Li, Xiaowei Ye, Fusheng Jin, Yu-Ping Wang, Ye Yuan, Guoren Wang

TL;DR

A general framework called HCSPivot is proposed, which can be applied to count all kinds of HCS, and two novel algorithms with several carefully designed pruning techniques to count s-defective cliques and s-plexes, which are two specific types of HCS.

Abstract

Counting small cohesive subgraphs in a graph is a fundamental operation with numerous applications in graph analysis. Previous studies on cohesive subgraph counting are mainly based on the clique model, which aim to count the number of $k$-cliques in a graph with a small $k$. However, the clique model often proves too restrictive for practical use. To address this issue, we investigate a new problem of counting cohesive subgraphs that adhere to the hereditary property. Here the hereditary property means that if a graph $G$ has a property $\mathcal{P}$, then any induced subgraph of $G$ also has a property $\mathcal{P}$. To count these hereditary cohesive subgraphs (\hcss), we propose a new listing-based framework called \hcslist, which employs a backtracking enumeration procedure to count all \hcss. A notable limitation of \hcslist is that it requires enumerating all \hcss, making it intractable for large and dense graphs due to the exponential growth in the number of \hcss with respect to graph size. To overcome this limitation, we propose a novel pivot-based framework called \hcspivot, which can count most \hcss in a combinatorial manner without explicitly listing them. Two additional noteworthy features of \hcspivot is its ability to (1) simultaneously count \hcss of any size and (2) simultaneously count \hcss for each vertex or each edge, while \hcslist is only capable of counting a specific size of \hcs and obtaining a total count of \hcss in a graph. We focus specifically on two \hcs: $s$-defective clique and $s$-plex, with several non-trivial pruning techniques to enhance the efficiency. We conduct extensive experiments on 8 large real-world graphs, and the results demonstrate the high efficiency and effectiveness of our solutions.

Counting Cohesive Subgraphs with Hereditary Properties

TL;DR

Abstract

-cliques in a graph with a small

. However, the clique model often proves too restrictive for practical use. To address this issue, we investigate a new problem of counting cohesive subgraphs that adhere to the hereditary property. Here the hereditary property means that if a graph

has a property

, then any induced subgraph of

also has a property

. To count these hereditary cohesive subgraphs (\hcss), we propose a new listing-based framework called \hcslist, which employs a backtracking enumeration procedure to count all \hcss. A notable limitation of \hcslist is that it requires enumerating all \hcss, making it intractable for large and dense graphs due to the exponential growth in the number of \hcss with respect to graph size. To overcome this limitation, we propose a novel pivot-based framework called \hcspivot, which can count most \hcss in a combinatorial manner without explicitly listing them. Two additional noteworthy features of \hcspivot is its ability to (1) simultaneously count \hcss of any size and (2) simultaneously count \hcss for each vertex or each edge, while \hcslist is only capable of counting a specific size of \hcs and obtaining a total count of \hcss in a graph. We focus specifically on two \hcs:

-defective clique and

-plex, with several non-trivial pruning techniques to enhance the efficiency. We conduct extensive experiments on 8 large real-world graphs, and the results demonstrate the high efficiency and effectiveness of our solutions.

Paper Structure (19 sections, 18 theorems, 6 figures, 7 tables, 3 algorithms)

This paper contains 19 sections, 18 theorems, 6 figures, 7 tables, 3 algorithms.

Introduction
Preliminaries
The Listing-based Solutions
A general listing framework $\mathsf{HCSList}$
Listing-based $s$-dclique counting
Listing-based $s$-plex counting
The pivot-based solutions
Warm up: a new listing strategy
A general pivot-based counting framework
Pivot-based $s$-dclique counting
Pivot-based $s$-plex counting
Discussions
Experiments
Experimental setup
Performance studies
...and 4 more sections

Key Result

lemma 1

A $s$-dclique with size $q$ such that $q-2 \ge s$ has diameter at most $2$.

Figures (6)

Figure 1: Running example.
Figure 2: The counts of different kinds of $\mathsf{HCS}$.
Figure 3: Illustration of the recursion tree of Algorithm \ref{['alg:pivot']} on the graph in Fig. \ref{['sfig:graph_a']}.
Figure 4: Effectiveness of the upper bounds ($\mathsf{WikiV}$).
Figure 5: Memory overheads of different algorithms
...and 1 more figures

Theorems & Definitions (22)

definition 1: Hereditary graph
definition 2: $s$-dclique Defective_bioinformatics_06
definition 3: $s$-plex Plex_seidman1978graph
lemma 1
lemma 2: Plex_seidman1978graph
theorem 1
lemma 3
lemma 4
lemma 5
theorem 2
...and 12 more

Counting Cohesive Subgraphs with Hereditary Properties

TL;DR

Abstract

Counting Cohesive Subgraphs with Hereditary Properties

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (22)