Guided Exploration of Sequential Rules

Wensheng Gan; Gengsen Huang; Junyu Ren; Philip S. Yu

Guided Exploration of Sequential Rules

Wensheng Gan, Gengsen Huang, Junyu Ren, Philip S. Yu

TL;DR

This paper introduces a novel method for efficiently generating target sequential rules using the proposed techniques and pruning strategies, and proposes the corresponding mining algorithms for two common evaluation metrics: frequency and utility.

Abstract

In pattern mining, sequential rules provide a formal framework to capture the temporal relationships and inferential dependencies between items. However, the discovery process is computationally intensive. To obtain mining results efficiently and flexibly, many methods have been proposed that rely on specific evaluation metrics (i.e., ensuring results meet minimum threshold requirements). A key issue with these methods, however, is that they generate many sequential rules that are irrelevant to users. Such rules not only incur additional computational overhead but also complicate downstream analysis. In this paper, we investigate how to efficiently discover user-centric sequential rules. The original database is first processed to determine whether a target query rule is present. To prune unpromising items and avoid unnecessary expansions, we design tight and generalizable upper bounds. We introduce a novel method for efficiently generating target sequential rules using the proposed techniques and pruning strategies. In addition, we propose the corresponding mining algorithms for two common evaluation metrics: frequency and utility. We also design two rule similarity metrics to help discover the most relevant sequential rules. Extensive experiments demonstrate that our algorithms outperform state-of-the-art approaches in terms of runtime and memory usage, while discovering a concise set of sequential rules under flexible similarity settings. Targeted sequential rule search can handle sequence data with personalized features and achieve pattern discovery. The proposed solution addresses several challenges and can be applied to two common mining tasks.

Guided Exploration of Sequential Rules

TL;DR

Abstract

Paper Structure (14 sections, 8 theorems, 11 equations, 5 figures, 4 tables, 5 algorithms)

This paper contains 14 sections, 8 theorems, 11 equations, 5 figures, 4 tables, 5 algorithms.

Introduction
Related Work
Interesting Pattern Mining
Sequential Rule Mining
Targeted Pattern Mining
Definitions
Algorithm
The Most Useful Sequential Rule Search
Experiments
Experimental Setup and Datasets
Efficiency Analysis
Technique Analysis
Effect of Target Rule Similarity Metric
Conclusion

Key Result

Lemma 1

For a database $\mathcal{D}$ and its modified database $\mathcal{D}^\prime$ and a query rule qr whose length is larger than or equal to 1 * 1, if the sequential rule iX$\rightarrow$iY meets two minimum thresholds in the database $\mathcal{D}^\prime$, then qr itself is a valid target sequential rule

Figures (5)

Figure 1: Runtime under different FPM tasks.
Figure 2: Runtime under different UPM tasks.
Figure 3: Runtime for different algorithm variants.
Figure 4: Runtime for different algorithm variants.
Figure 5: Runtime for different algorithm variants.

Theorems & Definitions (19)

Definition 3.1: Sequence database zaki2001spade
Definition 3.2: Sequential rule fournier2014erminergan2025towards
Definition 3.3: Rule size and rule inclusion gan2025towards
Definition 3.4: Calculation of attribute value
Definition 3.5: Confidence value agrawal1994fasthipp2000algorithms
Definition 3.6: Target sequential rule
Definition 4.1: Rule expansion gan2025towards
Definition 4.2: Rule instance set
Lemma 1
Lemma 2
...and 9 more

Guided Exploration of Sequential Rules

TL;DR

Abstract

Guided Exploration of Sequential Rules

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (19)