A method to benchmark high-dimensional process drift detection
Edgar Wolf, Tobias Windisch
TL;DR
This work addresses benchmarking drift detection for high‑dimensional process curves by introducing a controllable data-generation framework based on moving support points in curve space, enabling ground-truth drift segmentation. It defines a temporal performance metric, TAUC, (and a soft variant, sTAUC) that evaluates detectors by integrating false positives against temporal overlap with true drift segments, capturing temporal context that traditional AUC misses. The authors provide a modular detector pipeline and demonstrate a small benchmark showing autoencoder‑based and multivariate tests improve TAUC, while many existing methods falter with multiple drift segments. The driftbench framework and TAUC/SOLS metrics offer a reproducible, scalable basis for developing and comparing drift detectors with practical impact for manufacturing and quality control.
Abstract
Process curves are multivariate finite time series data coming from manufacturing processes. This paper studies machine learning that detect drifts in process curve datasets. A theoretic framework to synthetically generate process curves in a controlled way is introduced in order to benchmark machine learning algorithms for process drift detection. An evaluation score, called the temporal area under the curve, is introduced, which allows to quantify how well machine learning models unveil curves belonging to drift segments. Finally, a benchmark study comparing popular machine learning approaches on synthetic data generated with the introduced framework is presented that shows that existing algorithms often struggle with datasets containing multiple drift segments.
