Table of Contents
Fetching ...

Efficient Hypergraph Pattern Matching via Match-and-Filter and Intersection Constraint

Siwoo Song, Wonseok Shin, Kunsoo Park, Giuseppe F. Italiano, Zhengyi Yang, Wenjie Zhang

TL;DR

<3-5 sentence high-level summary> The paper addresses the challenging problem of hypergraph pattern matching, which is NP-hard, by introducing MaCH, a framework that combines a novel intersection constraint, a candidate hyperedge space (CHS), and a Match-and-Filter approach to prune the search space during backtracking. The key ideas are the formalization of three constraints (Hyperedge Signature, Connectivity, and Intersection), and the demonstration that the Intersection Constraint provides a necessary-and-sufficient condition for valid embeddings via efficient cell-based verification. Empirically, MaCH significantly outperforms state-of-the-art methods (HGMatch, OHMiner, GuP) on real and large-scale hypergraphs, achieving up to orders-of-magnitude speedups and lower memory usage. The work suggests broad applicability of the intersection constraint and the CHS/Match-and-Filter paradigm beyond hypergraph pattern matching.

Abstract

A hypergraph is a generalization of a graph, in which a hyperedge can connect multiple vertices, modeling complex relationships involving multiple vertices simultaneously. Hypergraph pattern matching, which is to find all isomorphic embeddings of a query hypergraph in a data hypergraph, is one of the fundamental problems. In this paper, we present a novel algorithm for hypergraph pattern matching by introducing (1) the intersection constraint, a necessary and sufficient condition for valid embeddings, which significantly speeds up the verification process, (2) the candidate hyperedge space, a data structure that stores potential mappings between hyperedges in the query hypergraph and the data hypergraph, and (3) the Match-and-Filter framework, which interleaves matching and filtering operations to maintain only compatible candidates in the candidate hyperedge space during backtracking. Experimental results on real-world datasets demonstrate that our algorithm significantly outperforms the state-of-the-art algorithms, by up to orders of magnitude in terms of query processing time.

Efficient Hypergraph Pattern Matching via Match-and-Filter and Intersection Constraint

TL;DR

<3-5 sentence high-level summary> The paper addresses the challenging problem of hypergraph pattern matching, which is NP-hard, by introducing MaCH, a framework that combines a novel intersection constraint, a candidate hyperedge space (CHS), and a Match-and-Filter approach to prune the search space during backtracking. The key ideas are the formalization of three constraints (Hyperedge Signature, Connectivity, and Intersection), and the demonstration that the Intersection Constraint provides a necessary-and-sufficient condition for valid embeddings via efficient cell-based verification. Empirically, MaCH significantly outperforms state-of-the-art methods (HGMatch, OHMiner, GuP) on real and large-scale hypergraphs, achieving up to orders-of-magnitude speedups and lower memory usage. The work suggests broad applicability of the intersection constraint and the CHS/Match-and-Filter paradigm beyond hypergraph pattern matching.

Abstract

A hypergraph is a generalization of a graph, in which a hyperedge can connect multiple vertices, modeling complex relationships involving multiple vertices simultaneously. Hypergraph pattern matching, which is to find all isomorphic embeddings of a query hypergraph in a data hypergraph, is one of the fundamental problems. In this paper, we present a novel algorithm for hypergraph pattern matching by introducing (1) the intersection constraint, a necessary and sufficient condition for valid embeddings, which significantly speeds up the verification process, (2) the candidate hyperedge space, a data structure that stores potential mappings between hyperedges in the query hypergraph and the data hypergraph, and (3) the Match-and-Filter framework, which interleaves matching and filtering operations to maintain only compatible candidates in the candidate hyperedge space during backtracking. Experimental results on real-world datasets demonstrate that our algorithm significantly outperforms the state-of-the-art algorithms, by up to orders of magnitude in terms of query processing time.

Paper Structure

This paper contains 41 sections, 7 theorems, 12 figures, 2 tables, 4 algorithms.

Key Result

Lemma 1

Given a query hypergraph $q$ and a data hypergraph $H$, a hyperedge $e \in E_q$ can be mapped to a hyperedge $f \in E_H$ only if $e$ and $f$ have identical signatures.

Figures (12)

  • Figure 1: Example of hypergraph pattern matching where 'A' and 'B' are vertex labels, $u_i$ and $v_i$ are vertex IDs, and $e_i$ and $f_i$ are hyperedge IDs.
  • Figure 2: Hypergraph representation of a transistor-level circuit, where vertices are devices (transistors $M_1$ to $M_4$, resistors $R_1$ and $R_2$, $V_{DD}$, and $GND$) and a hyperedges is a net connecting multiple devices simultaneously.
  • Figure 3: Candidate hyperedge space on $q$ and $H$ in \ref{['fig:hypergraphs']} before and after filtering
  • Figure 4: Match-and-Filter applied to CHS on $q$ and $H$ in \ref{['fig:hypergraphs']}. Current partial embedding $M$ is $\{(e_1, f_1), (e_2, f_3)\}$ (blue). Incompatible candidates (red) are filtered by connectivity constraint in (a) and intersection constraints in (b).
  • Figure 5: Intersection and cells of hyperedges for the partial embedding $\{(e_1, f_2), (e_2, f_1), (e_3,f_3)\}$
  • ...and 7 more figures

Theorems & Definitions (22)

  • Definition 1: Hypergraph
  • Definition 2: Subhypergraph isomorphism
  • Definition 3
  • Lemma 1: Hyperedge Signature Constraint
  • Lemma 2: Connectivity Constraint
  • Example 1
  • Theorem 1: Intersection Constraint
  • Example 2
  • Definition 4
  • Definition 5
  • ...and 12 more