An Adaptive Sampling Framework for Detecting Localized Concept Drift under Label Scarcity
Junghee Pyeon, Davide Cacciarelli, Kamran Paynabar
TL;DR
PASS addresses the challenge of detecting localized concept drift in regression under label scarcity by integrating residual-informed exploitation, accept–reject exploration, and EWMA-based monitoring into a unified online framework. The method allocates labeling budgets via an $\epsilon$-greedy strategy, targets high-residual regions with a residual-weighted inverse transform sampling, and maintains broad coverage through time-weighted exploration, all while monitoring drift with two one-sided EWMA charts on top-$r$ residuals and residual dispersion. Theoretical properties guarantee that exploitation has a positive chance to hit drift regions and exploration cannot neglect any region, and simulations across Branin, Ishigami, Friedman, and Linkletter demonstrate robust detection under abrupt and incremental drifts, outperforming random and score-based baselines. A UK electricity market case study shows PASS can match full-sampling performance with a fraction of labels, highlighting practical impact for real-time monitoring under labeling constraints.
Abstract
Concept drift and label scarcity are two critical challenges limiting the robustness of predictive models in dynamic industrial environments. Existing drift detection methods often assume global shifts and rely on dense supervision, making them ill-suited for regression tasks with local drifts and limited labels. This paper proposes an adaptive sampling framework that combines residual-based exploration and exploitation with EWMA monitoring to efficiently detect local concept drift under labeling budget constraints. Empirical results on synthetic benchmarks and a case study on electricity market demonstrate superior performance in label efficiency and drift detection accuracy.
