Table of Contents
Fetching ...

Targeted Mining Precise-positioning Episode Rules

Jian Zhu, Xiaoye Chen, Wensheng Gan, Zefeng Chen, Philip S. Yu

TL;DR

This work defines targeted mining of precise-positioning episode rules and introduces TaMIPER, a two-phase, tree-based algorithm designed to discover complete, accurate TaPERs that satisfy user-defined targets, supports, and confidences. It leverages four pruning strategies (PBPS, DBPS, NBPS, LBPS) to drastically reduce candidate exploration and improve runtime and memory efficiency. Comprehensive experiments on six real datasets demonstrate TaMIPER's superior performance and scalability compared to baselines, with case studies highlighting actionable, domain-relevant rules (e.g., bike-sharing station pairs). The approach enables precise, user-driven rule discovery in complex event sequences, with potential applications in weather, network security, and e-commerce, and points to future work on more sophisticated targeted mining structures and constraints.

Abstract

The era characterized by an exponential increase in data has led to the widespread adoption of data intelligence as a crucial task. Within the field of data mining, frequent episode mining has emerged as an effective tool for extracting valuable and essential information from event sequences. Various algorithms have been developed to discover frequent episodes and subsequently derive episode rules using the frequency function and anti-monotonicity principles. However, currently, there is a lack of algorithms specifically designed for mining episode rules that encompass user-specified query episodes. To address this challenge and enable the mining of target episode rules, we introduce the definition of targeted precise-positioning episode rules and formulate the problem of targeted mining precise-positioning episode rules. Most importantly, we develop an algorithm called Targeted Mining Precision Episode Rules (TaMIPER) to address the problem and optimize it using four proposed strategies, leading to significant reductions in both time and space resource requirements. As a result, TaMIPER offers high accuracy and efficiency in mining episode rules of user interest and holds promising potential for prediction tasks in various domains, such as weather observation, network intrusion, and e-commerce. Experimental results on six real datasets demonstrate the exceptional performance of TaMIPER.

Targeted Mining Precise-positioning Episode Rules

TL;DR

This work defines targeted mining of precise-positioning episode rules and introduces TaMIPER, a two-phase, tree-based algorithm designed to discover complete, accurate TaPERs that satisfy user-defined targets, supports, and confidences. It leverages four pruning strategies (PBPS, DBPS, NBPS, LBPS) to drastically reduce candidate exploration and improve runtime and memory efficiency. Comprehensive experiments on six real datasets demonstrate TaMIPER's superior performance and scalability compared to baselines, with case studies highlighting actionable, domain-relevant rules (e.g., bike-sharing station pairs). The approach enables precise, user-driven rule discovery in complex event sequences, with potential applications in weather, network security, and e-commerce, and points to future work on more sophisticated targeted mining structures and constraints.

Abstract

The era characterized by an exponential increase in data has led to the widespread adoption of data intelligence as a crucial task. Within the field of data mining, frequent episode mining has emerged as an effective tool for extracting valuable and essential information from event sequences. Various algorithms have been developed to discover frequent episodes and subsequently derive episode rules using the frequency function and anti-monotonicity principles. However, currently, there is a lack of algorithms specifically designed for mining episode rules that encompass user-specified query episodes. To address this challenge and enable the mining of target episode rules, we introduce the definition of targeted precise-positioning episode rules and formulate the problem of targeted mining precise-positioning episode rules. Most importantly, we develop an algorithm called Targeted Mining Precision Episode Rules (TaMIPER) to address the problem and optimize it using four proposed strategies, leading to significant reductions in both time and space resource requirements. As a result, TaMIPER offers high accuracy and efficiency in mining episode rules of user interest and holds promising potential for prediction tasks in various domains, such as weather observation, network intrusion, and e-commerce. Experimental results on six real datasets demonstrate the exceptional performance of TaMIPER.
Paper Structure (22 sections, 7 figures, 7 tables, 3 algorithms)

This paper contains 22 sections, 7 figures, 7 tables, 3 algorithms.

Figures (7)

  • Figure 1: A sequence of events. This sequence denotes the presence of event $D$ at moment 1, the absence of any event at moment 2, the co-occurrence of events $A$ and $D$ at moment 3, the co-occurrence of events $A$ and $B$ at moment 4, and so on for subsequent moments.
  • Figure 2: The TaPER-tree $\mathcal{T}_{<A>}$ when $i$ = 1 established by Algorithm \ref{['alg: MiningFEO']}.
  • Figure 3: The TaPER-tree $\mathcal{T}_{<A>}$ when $i$ = 3 established by Algorithm \ref{['alg: MiningFEO']}.
  • Figure 4: Runtime on each event dataset under different minconf.
  • Figure 5: Memory consumption on each dataset under different minconf.
  • ...and 2 more figures

Theorems & Definitions (11)

  • Definition 3.1: Event and event sequence ao2017mining
  • Definition 3.2: Episode and episode occurrence ao2017mining
  • Definition 3.3: Minimal episode occurrence (MEO) ao2017mining
  • Definition 3.4: Fixed-gap episode and fixed-gap episode occurrence (FEO) ao2017mining
  • Definition 3.5: Support of minimal episode and support of fixed-gap episode ao2017mining
  • Definition 3.6: Super-episode and superset of episode huang2024taspm
  • Definition 3.7: Episode rule ao2017mining
  • Definition 3.8: Target episode and target episode mining
  • Definition 3.9: Targeted precise-positioning episode rule
  • Definition 3.10: Occurrence of TaPER ao2017mining
  • ...and 1 more