Interactive Multi Interest Process Pattern Discovery
Mozhgan Vazifehdoostirani, Laura Genga, Xixi Lu, Rob Verhoeven, Hanneke van Laarhoven, Remco Dijkman
TL;DR
This work tackles the problem of discovering process patterns when analysts require multi-dimensional, outcome-oriented insights rather than a single-frequency view. It introduces the IMPresseD framework, an interactive, multi-interest PPDM that defines customizable interest functions and uses Pareto-front optimization to present non-dominated patterns, with pattern extension guided by a set of extension operators. The approach is demonstrated through a healthcare case study and multiple public logs, showing that Pareto-front–driven patterns achieve comparable or better predictive power while substantially reducing the number of patterns to inspect. The framework provides interpretable dashboards and an iterative, human-in-the-loop workflow, offering practical benefits for outcome-oriented process analysis and potential for automated deployment with reduced pattern explosion.
Abstract
Process pattern discovery methods (PPDMs) aim at identifying patterns of interest to users. Existing PPDMs typically are unsupervised and focus on a single dimension of interest, such as discovering frequent patterns. We present an interactive multi interest driven framework for process pattern discovery aimed at identifying patterns that are optimal according to a multi-dimensional analysis goal. The proposed approach is iterative and interactive, thus taking experts knowledge into account during the discovery process. The paper focuses on a concrete analysis goal, i.e., deriving process patterns that affect the process outcome. We evaluate the approach on real world event logs in both interactive and fully automated settings. The approach extracted meaningful patterns validated by expert knowledge in the interactive setting. Patterns extracted in the automated settings consistently led to prediction performance comparable to or better than patterns derived considering single interest dimensions without requiring user defined thresholds.
