Table of Contents
Fetching ...

Guided Exploration of Sequential Rules

Wensheng Gan, Gengsen Huang, Junyu Ren, Philip S. Yu

TL;DR

This paper introduces a novel method for efficiently generating target sequential rules using the proposed techniques and pruning strategies, and proposes the corresponding mining algorithms for two common evaluation metrics: frequency and utility.

Abstract

In pattern mining, sequential rules provide a formal framework to capture the temporal relationships and inferential dependencies between items. However, the discovery process is computationally intensive. To obtain mining results efficiently and flexibly, many methods have been proposed that rely on specific evaluation metrics (i.e., ensuring results meet minimum threshold requirements). A key issue with these methods, however, is that they generate many sequential rules that are irrelevant to users. Such rules not only incur additional computational overhead but also complicate downstream analysis. In this paper, we investigate how to efficiently discover user-centric sequential rules. The original database is first processed to determine whether a target query rule is present. To prune unpromising items and avoid unnecessary expansions, we design tight and generalizable upper bounds. We introduce a novel method for efficiently generating target sequential rules using the proposed techniques and pruning strategies. In addition, we propose the corresponding mining algorithms for two common evaluation metrics: frequency and utility. We also design two rule similarity metrics to help discover the most relevant sequential rules. Extensive experiments demonstrate that our algorithms outperform state-of-the-art approaches in terms of runtime and memory usage, while discovering a concise set of sequential rules under flexible similarity settings. Targeted sequential rule search can handle sequence data with personalized features and achieve pattern discovery. The proposed solution addresses several challenges and can be applied to two common mining tasks.

Guided Exploration of Sequential Rules

TL;DR

This paper introduces a novel method for efficiently generating target sequential rules using the proposed techniques and pruning strategies, and proposes the corresponding mining algorithms for two common evaluation metrics: frequency and utility.

Abstract

In pattern mining, sequential rules provide a formal framework to capture the temporal relationships and inferential dependencies between items. However, the discovery process is computationally intensive. To obtain mining results efficiently and flexibly, many methods have been proposed that rely on specific evaluation metrics (i.e., ensuring results meet minimum threshold requirements). A key issue with these methods, however, is that they generate many sequential rules that are irrelevant to users. Such rules not only incur additional computational overhead but also complicate downstream analysis. In this paper, we investigate how to efficiently discover user-centric sequential rules. The original database is first processed to determine whether a target query rule is present. To prune unpromising items and avoid unnecessary expansions, we design tight and generalizable upper bounds. We introduce a novel method for efficiently generating target sequential rules using the proposed techniques and pruning strategies. In addition, we propose the corresponding mining algorithms for two common evaluation metrics: frequency and utility. We also design two rule similarity metrics to help discover the most relevant sequential rules. Extensive experiments demonstrate that our algorithms outperform state-of-the-art approaches in terms of runtime and memory usage, while discovering a concise set of sequential rules under flexible similarity settings. Targeted sequential rule search can handle sequence data with personalized features and achieve pattern discovery. The proposed solution addresses several challenges and can be applied to two common mining tasks.
Paper Structure (14 sections, 8 theorems, 11 equations, 5 figures, 4 tables, 5 algorithms)

This paper contains 14 sections, 8 theorems, 11 equations, 5 figures, 4 tables, 5 algorithms.

Key Result

Lemma 1

For a database $\mathcal{D}$ and its modified database $\mathcal{D}^\prime$ and a query rule qr whose length is larger than or equal to 1 * 1, if the sequential rule iX$\rightarrow$iY meets two minimum thresholds in the database $\mathcal{D}^\prime$, then qr itself is a valid target sequential rule

Figures (5)

  • Figure 1: Runtime under different FPM tasks.
  • Figure 2: Runtime under different UPM tasks.
  • Figure 3: Runtime for different algorithm variants.
  • Figure 4: Runtime for different algorithm variants.
  • Figure 5: Runtime for different algorithm variants.

Theorems & Definitions (19)

  • Definition 3.1: Sequence database zaki2001spade
  • Definition 3.2: Sequential rule fournier2014erminergan2025towards
  • Definition 3.3: Rule size and rule inclusion gan2025towards
  • Definition 3.4: Calculation of attribute value
  • Definition 3.5: Confidence value agrawal1994fasthipp2000algorithms
  • Definition 3.6: Target sequential rule
  • Definition 4.1: Rule expansion gan2025towards
  • Definition 4.2: Rule instance set
  • Lemma 1
  • Lemma 2
  • ...and 9 more