A Novel Approach to Process Discovery with Enhanced Loop Handling
Ali Nour Eldin, Benjamin Dalmas, Walid Gaaloul
TL;DR
Bonita Miner tackles the challenge of process discovery in logs with complex loop structures by integrating Directly-Follows Graph filtering and a Depth-First Algorithm driven gateway construction, with loops built separately to keep models simple yet behaviorally accurate. The two-phase approach first purges concurrency and loops from the graph, then constructs a BPMN diagram in a single pass, producing smaller, more interpretable models that balance fitness, precision, and generalization. Empirical results on BPIC real logs and synthetic loop scenarios show Bonita Miner achieving high fitness and F-scores while reducing model complexity compared to state-of-the-art baselines, demonstrating robust loop handling and generalization. The work offers practical BPMN models with improved readability and reliability, and points to future work in runtime optimization and deadlock detection to further increase industrial applicability.
Abstract
Automated process discovery from event logs is a key component of process mining, allowing companies to acquire meaningful insights into their business processes. Despite significant research, present methods struggle to balance important quality dimensions: fitness, precision, generalization, and complexity, but is limited when dealing with complex loop structures. This paper introduces Bonita Miner, a novel approach to process model discovery that generates behaviorally accurate Business Process Model and Notation (BPMN) diagrams. Bonita Miner incorporates an advanced filtering mechanism for Directly Follows Graphs (DFGs) alongside innovative algorithms designed to capture concurrency, splits, and loops, effectively addressing limitations of balancing as much as possible these four metrics, either there exists a loop, which challenge in existing works. Our approach produces models that are simpler and more reflective of the behavior of real-world processes, including complex loop dynamics. Empirical evaluations using real-world event logs demonstrate that Bonita Miner outperforms existing methods in fitness, precision, and generalization, while maintaining low model complexity.
