Bootstrapped Control Limits for Score-Based Concept Drift Control Charts

Jiezhong Wu; Daniel W. Apley

Bootstrapped Control Limits for Score-Based Concept Drift Control Charts

Jiezhong Wu, Daniel W. Apley

TL;DR

This work tackles concept drift detection in predictive models by focusing on the mean of Fisher score vectors and monitoring with a score-based MEWMA statistic. It introduces a novel nested bootstrap procedure to calibrate time-varying control limits using the full training data, accompanied by a 0.632-like variance correction to correctly reflect finite-sample variability. The approach yields accurate, pointwise false-alarm control and is data-efficient, enabling application with complex models, including deep neural networks. The results on linear and nonlinear dynamical systems demonstrate robust drift detection while maintaining proper Type I error rates, highlighting practical utility for real-time monitoring of predictive relationships.

Abstract

Monitoring for changes in a predictive relationship represented by a fitted supervised learning model (i.e., concept drift detection) is a widespread problem in modern data-driven applications. A general and powerful Fisher score-based concept drift approach was recently proposed, in which detecting concept drift reduces to detecting changes in the mean of the model's score vector using a multivariate exponentially weighted moving average (MEWMA). To implement the approach, the initial data must be split into two subsets. The first subset serves as the training sample to which the model is fit, and the second subset serves as an out-of-sample test set from which the MEWMA control limit (CL) is determined. In this paper, we retain the same score-based MEWMA monitoring statistic as the existing method and focus instead on improving the computation of the control limit. We develop a novel nested bootstrap procedure for calibrating the CL that allows the entire initial sample to be used for model fitting, thereby yielding a more accurate baseline model while eliminating the need for a large holdout set. We show that a standard nested bootstrap substantially underestimates the variability of the monitoring statistic and develop a 0.632-like correction that appropriately accounts for this. We demonstrate the advantages with numerical examples.

Bootstrapped Control Limits for Score-Based Concept Drift Control Charts

TL;DR

Abstract

Bootstrapped Control Limits for Score-Based Concept Drift Control Charts

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (16)