Table of Contents
Fetching ...

A Projected Upper Bound for Mining High Utility Patterns from Interval-Based Event Sequences

S. Mohammad Mirbagheri

TL;DR

This paper introduces a projected upper bound $\mathcal{P}_k$ for mining high utility patterns from interval-based event sequences (e-sequences), aiming to accelerate HUIPMiner through tighter pruning than the existing $\mathrm{LWU}_k$ bound. By defining $\mathcal{P}_k(L)=u_{max}(L)+\mathrm{LWU}_{k-|L|}(L)$ and proving $\mathcal{P}_k(L) \le \mathrm{LWU}_k(L)$, the authors establish a projected downward-closure property that dynamically tightens as pattern length grows. Empirical evaluations on the BlocksmorchenSensor dataset show that using $\mathcal{P}_k$ yields notable improvements in execution time (up to about 2x) and memory usage (a few percent on average) without losing completeness, demonstrating practical gains for high utility interval-based pattern mining. The approach advances efficient mining in domains with interval-based events by enhancing pruning while maintaining exact pattern discovery.

Abstract

High utility pattern mining is an interesting yet challenging problem. The intrinsic computational cost of the problem will impose further challenges if efficiency in addition to the efficacy of a solution is sought. Recently, this problem was studied on interval-based event sequences with a constraint on the length and size of the patterns. However, the proposed solution lacks adequate efficiency. To address this issue, we propose a projected upper bound on the utility of the patterns discovered from sequences of interval-based events. To show its effectiveness, the upper bound is utilized by a pruning strategy employed by the HUIPMiner algorithm. Experimental results show that the new upper bound improves HUIPMiner performance in terms of both execution time and memory usage.

A Projected Upper Bound for Mining High Utility Patterns from Interval-Based Event Sequences

TL;DR

This paper introduces a projected upper bound for mining high utility patterns from interval-based event sequences (e-sequences), aiming to accelerate HUIPMiner through tighter pruning than the existing bound. By defining and proving , the authors establish a projected downward-closure property that dynamically tightens as pattern length grows. Empirical evaluations on the BlocksmorchenSensor dataset show that using yields notable improvements in execution time (up to about 2x) and memory usage (a few percent on average) without losing completeness, demonstrating practical gains for high utility interval-based pattern mining. The approach advances efficient mining in domains with interval-based events by enhancing pruning while maintaining exact pattern discovery.

Abstract

High utility pattern mining is an interesting yet challenging problem. The intrinsic computational cost of the problem will impose further challenges if efficiency in addition to the efficacy of a solution is sought. Recently, this problem was studied on interval-based event sequences with a constraint on the length and size of the patterns. However, the proposed solution lacks adequate efficiency. To address this issue, we propose a projected upper bound on the utility of the patterns discovered from sequences of interval-based events. To show its effectiveness, the upper bound is utilized by a pruning strategy employed by the HUIPMiner algorithm. Experimental results show that the new upper bound improves HUIPMiner performance in terms of both execution time and memory usage.
Paper Structure (12 sections, 5 theorems, 12 equations, 3 figures, 4 tables)

This paper contains 12 sections, 5 theorems, 12 equations, 3 figures, 4 tables.

Key Result

lemma thmcounterlemma

Given a C-sequence $C$, where $|C| \leq k' \leq k$, then

Figures (3)

  • Figure 1: Performance Comparison of the HUIPMiner algorithm under various $\xi$
  • Figure 2: Performance Comparison of the HUIPMiner algorithm under various $K$
  • Figure 3: Number of patterns discovered by the HUIPMiner algorithm when applying the pruning strategy based on either LDC or PDC property

Theorems & Definitions (23)

  • Definition 1
  • Definition 2
  • Definition 3
  • Definition 4
  • Definition 5
  • Definition 6
  • Definition 7
  • Definition 8
  • Definition 9
  • Definition 10
  • ...and 13 more