AutoML for Multi-Class Anomaly Compensation of Sensor Drift

Melanie Schaller; Mathis Kruse; Antonio Ortega; Marius Lindauer; Bodo Rosenhahn

AutoML for Multi-Class Anomaly Compensation of Sensor Drift

Melanie Schaller, Mathis Kruse, Antonio Ortega, Marius Lindauer, Bodo Rosenhahn

TL;DR

Sensor drift degrades ML performance over time, and conventional cross-validation can misrepresent this effect due to temporal leakage. The authors propose a two-pronged solution: a novel sensor drift compensation training paradigm and AutoML-DC, which combines anomaly-detection-inspired training with incremental batch learning and automates model/feature/hyperparameter selection using AutoML. They formalize the drift problem across chronological batches and optimize a robust ensemble via CASH (as implemented in auto-sklearn), reporting superior F1 and AUC-ROC on Vergara’s real-world drift dataset and demonstrating strong online-adaptation capabilities. The results show substantial performance gains, underscoring the practical impact for industrial sensor systems, while noting limitations such as dataset generalizability and the need for additional real-world data and unsupervised extensions for broader applicability.

Abstract

Addressing sensor drift is essential in industrial measurement systems, where precise data output is necessary for maintaining accuracy and reliability in monitoring processes, as it progressively degrades the performance of machine learning models over time. Our findings indicate that the standard cross-validation method used in existing model training overestimates performance by inadequately accounting for drift. This is primarily because typical cross-validation techniques allow data instances to appear in both training and testing sets, thereby distorting the accuracy of the predictive evaluation. As a result, these models are unable to precisely predict future drift effects, compromising their ability to generalize and adapt to evolving data conditions. This paper presents two solutions: (1) a novel sensor drift compensation learning paradigm for validating models, and (2) automated machine learning (AutoML) techniques to enhance classification performance and compensate sensor drift. By employing strategies such as data balancing, meta-learning, automated ensemble learning, hyperparameter optimization, feature selection, and boosting, our AutoML-DC (Drift Compensation) model significantly improves classification performance against sensor drift. AutoML-DC further adapts effectively to varying drift severities.

AutoML for Multi-Class Anomaly Compensation of Sensor Drift

TL;DR

Abstract

AutoML for Multi-Class Anomaly Compensation of Sensor Drift

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)