The Window Dilemma: Why Concept Drift Detection is Ill-Posed
Brandon Gower-Winter, Misja Groen, Georg Krempl
TL;DR
This paper argues that concept drift detection is fundamentally ill-posed and heavily dependent on how data are windowed, a phenomenon they term the Window Dilemma. Through both theoretical framing and experiments across real-world streams, the authors show that drift-aware detectors often do not outperform drift-unaware or batch retraining strategies, calling into question the practical value of binary drift detection. The work advocates shifting focus from detecting when drift occurs to understanding where and how drift manifests, and integrating drift insights with broader learning strategies such as transfer learning and change mining. Overall, the findings suggest that choosing a robust classifier and using periodic retraining can outperform many drift-detection approaches, prompting a reevaluation of current drift-detection paradigms in data streams.
Abstract
Non-stationarity of an underlying data generating process that leads to distributional changes over time is a key characteristic of Data Streams. This phenomenon, commonly referred to as Concept Drift, has been intensively studied, and Concept Drift Detectors have been established as a class of methods for detecting such changes (drifts). For the most part, Drift Detectors compare regions (windows) of the data stream and detect drift if those windows are sufficiently dissimilar. In this work, we introduce the Window Dilemma, an observation that perceived drift is a product of windowing and not necessarily the underlying data generating process. Additionally, we highlight that drift detection is ill-posed, primarily because verification of drift events are implausible in practice. We demonstrate these contributions first by an illustrative example, followed by empirical comparisons of drift detectors against a variety of alternative adaptation strategies. Our main finding is that traditional batch learning techniques often perform better than their drift-aware counterparts further bringing into question the purpose of detectors in Stream Classification.
