Table of Contents
Fetching ...

Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs

Yanping Wu, Renjie Sun, Xiaoyang Wang, Dong Wen, Ying Zhang, Lu Qin, Xuemin Lin

TL;DR

This work addresses maximal $\lambda$-frequency group (MFG) enumeration in temporal bipartite graphs, where unilateral vertex sets on one layer must form a $(\tau_U,\tau_V)$-biclique in at least $\lambda$ snapshots. It introduces a filter-and-verification (FilterV) approach that extends Bron–Kerbosch with $(\tau_V,\tau_U,\lambda)$-core pruning, efficient frequency checks via array-based counting, and a maximality check leveraging a processed-vertex set. To scale further, a verification-free (VFree) method uses a timestamp-oriented dynamic counting framework, achieving a theoretical $\mathcal{O}(|V|)$ speedup in valid-candidate computation and avoiding explicit maximality verification. Extensive experiments on 15 real-world graphs demonstrate that VFree achieves substantial speedups (up to three orders of magnitude) and provides effective, actionable MFG patterns for tasks such as multimorbidity detection and fraud analysis. The results confirm the practical viability and usefulness of modeling unilateral temporal cohesiveness with MFGs in diverse domains.

Abstract

Cohesive subgraph mining is a fundamental problem in bipartite graph analysis. In reality, relationships between two types of entities often occur at some specific timestamps, which can be modeled as a temporal bipartite graph. However, the temporal information is widely neglected by previous studies. Moreover, directly extending the existing models may fail to find some critical groups in temporal bipartite graphs, which appear in a unilateral (i.e., one-layer) form. To fill the gap, in this paper, we propose a novel model, called maximal λ-frequency group (MFG). Given a temporal bipartite graph G=(U,V,E), a vertex set V_S \subseteq V is an MFG if i) there are no less than λtimestamps, at each of which V_S can form a (t_U,t_V)-biclique with some vertices in U at the corresponding snapshot, and ii) it is maximal. To solve the problem, a filter-and-verification (FilterV) method is proposed based on the Bron-Kerbosch framework, incorporating novel filtering techniques to reduce the search space and array-based strategy to accelerate the frequency and maximality verification. Nevertheless, the cost of frequency verification in each valid candidate set computation and maximality check could limit the scalability of FilterV to larger graphs. Therefore, we further develop a novel verification-free (VFree) approach by leveraging the advanced dynamic counting structure proposed. Theoretically, we prove that VFree can reduce the cost of each valid candidate set computation in FilterV by a factor of O(|V|). Furthermore, VFree can avoid the explicit maximality verification because of the developed search paradigm. Finally, comprehensive experiments on 15 real-world graphs are conducted to demonstrate the efficiency and effectiveness of the proposed techniques and model.

Efficient Maximal Frequent Group Enumeration in Temporal Bipartite Graphs

TL;DR

This work addresses maximal -frequency group (MFG) enumeration in temporal bipartite graphs, where unilateral vertex sets on one layer must form a -biclique in at least snapshots. It introduces a filter-and-verification (FilterV) approach that extends Bron–Kerbosch with -core pruning, efficient frequency checks via array-based counting, and a maximality check leveraging a processed-vertex set. To scale further, a verification-free (VFree) method uses a timestamp-oriented dynamic counting framework, achieving a theoretical speedup in valid-candidate computation and avoiding explicit maximality verification. Extensive experiments on 15 real-world graphs demonstrate that VFree achieves substantial speedups (up to three orders of magnitude) and provides effective, actionable MFG patterns for tasks such as multimorbidity detection and fraud analysis. The results confirm the practical viability and usefulness of modeling unilateral temporal cohesiveness with MFGs in diverse domains.

Abstract

Cohesive subgraph mining is a fundamental problem in bipartite graph analysis. In reality, relationships between two types of entities often occur at some specific timestamps, which can be modeled as a temporal bipartite graph. However, the temporal information is widely neglected by previous studies. Moreover, directly extending the existing models may fail to find some critical groups in temporal bipartite graphs, which appear in a unilateral (i.e., one-layer) form. To fill the gap, in this paper, we propose a novel model, called maximal λ-frequency group (MFG). Given a temporal bipartite graph G=(U,V,E), a vertex set V_S \subseteq V is an MFG if i) there are no less than λtimestamps, at each of which V_S can form a (t_U,t_V)-biclique with some vertices in U at the corresponding snapshot, and ii) it is maximal. To solve the problem, a filter-and-verification (FilterV) method is proposed based on the Bron-Kerbosch framework, incorporating novel filtering techniques to reduce the search space and array-based strategy to accelerate the frequency and maximality verification. Nevertheless, the cost of frequency verification in each valid candidate set computation and maximality check could limit the scalability of FilterV to larger graphs. Therefore, we further develop a novel verification-free (VFree) approach by leveraging the advanced dynamic counting structure proposed. Theoretically, we prove that VFree can reduce the cost of each valid candidate set computation in FilterV by a factor of O(|V|). Furthermore, VFree can avoid the explicit maximality verification because of the developed search paradigm. Finally, comprehensive experiments on 15 real-world graphs are conducted to demonstrate the efficiency and effectiveness of the proposed techniques and model.
Paper Structure (17 sections, 10 theorems, 18 figures, 3 tables, 4 algorithms)

This paper contains 17 sections, 10 theorems, 18 figures, 3 tables, 4 algorithms.

Key Result

Lemma 2.1

Given a temporal bipartite graph $\mathcal{G}=(U,V,\mathcal{E})$ and three positive integers $\tau_U$, $\tau_V$ and $\lambda$, any MFG$V_S \subseteq V$ must be contained in a maximal $(\tau_U,\tau_V)$-biclique of the static bipartite graph $G$ of $\mathcal{G}$.

Figures (18)

  • Figure 1: Customer-product temporal bipartite graph (dotted lines denote the edges at $t=5$ for presentation simplicity)
  • Figure 2: A temporal bipartite graph $\mathcal{G}$ with six timestamps ($G_1$-$G_6$ are the corresponding snapshots and solid lines denote the edges in each snapshot)
  • Figure 3: Filter-and-Verification (FilterV)
  • Figure 4: GFCore$(\mathcal{G},\tau_U,\tau_V,\lambda)$
  • Figure 5: CheckFRE$(U_S,V_S,\lambda)$
  • ...and 13 more figures

Theorems & Definitions (26)

  • Example 1.1
  • Definition 2.1: Structural neighbor (s-neighbor)
  • Definition 2.2: Momentary neighbor (m-neighbor)
  • Example 2.1
  • Definition 2.3: $(\tau_U, \tau_V)$-biclique
  • Definition 2.4: Support timestamp
  • Definition 2.5: $\lambda$-frequency group
  • Definition 2.6: Maximal $\lambda$-frequency group (MFG)
  • Example 2.2
  • Lemma 2.1: Structural property
  • ...and 16 more