Table of Contents
Fetching ...

Unsupervised Concept Drift Detection based on Parallel Activations of Neural Network

Joanna Komorniczak, Paweł Ksieniewicz

TL;DR

The paper tackles concept drift in data streams under limited label availability by introducing the Parallel Activations Drift Detector (PADD), an unsupervised method that uses activations from a fixed, randomly initialized neural network to detect distribution shifts. Drift signaling is based on $r$ replications of a two-sample $t$-test across $e$ outputs, with drift declared when the number of significant tests $a$ exceeds $\theta\cdot e\cdot r$, after which the historical activation buffer is cleared. The authors validate PADD on synthetic streams with varying drift dynamics and feature counts, showing competitive performance against both unsupervised and supervised detectors while providing replicable experiments and open-source code. The work contributes a practical, label-free drift detector and outlines future directions toward non-parametric testing and modeling dependencies among NN outputs. Overall, PADD expands the toolkit for robust drift detection in data streams where labeling is scarce or delayed, enabling more reliable unsupervised monitoring in real-time AI systems.

Abstract

Practical applications of artificial intelligence increasingly often have to deal with the streaming properties of real data, which, considering the time factor, are subject to phenomena such as periodicity and more or less chaotic degeneration - resulting directly in the concept drifts. The modern concept drift detectors almost always assume immediate access to labels, which due to their cost, limited availability and possible delay has been shown to be unrealistic. This work proposes an unsupervised Parallel Activations Drift Detector, utilizing the outputs of an untrained neural network, presenting its key design elements, intuitions about processing properties, and a pool of computer experiments demonstrating its competitiveness with state-of-the-art methods.

Unsupervised Concept Drift Detection based on Parallel Activations of Neural Network

TL;DR

The paper tackles concept drift in data streams under limited label availability by introducing the Parallel Activations Drift Detector (PADD), an unsupervised method that uses activations from a fixed, randomly initialized neural network to detect distribution shifts. Drift signaling is based on replications of a two-sample -test across outputs, with drift declared when the number of significant tests exceeds , after which the historical activation buffer is cleared. The authors validate PADD on synthetic streams with varying drift dynamics and feature counts, showing competitive performance against both unsupervised and supervised detectors while providing replicable experiments and open-source code. The work contributes a practical, label-free drift detector and outlines future directions toward non-parametric testing and modeling dependencies among NN outputs. Overall, PADD expands the toolkit for robust drift detection in data streams where labeling is scarce or delayed, enabling more reliable unsupervised monitoring in real-time AI systems.

Abstract

Practical applications of artificial intelligence increasingly often have to deal with the streaming properties of real data, which, considering the time factor, are subject to phenomena such as periodicity and more or less chaotic degeneration - resulting directly in the concept drifts. The modern concept drift detectors almost always assume immediate access to labels, which due to their cost, limited availability and possible delay has been shown to be unrealistic. This work proposes an unsupervised Parallel Activations Drift Detector, utilizing the outputs of an untrained neural network, presenting its key design elements, intuitions about processing properties, and a pool of computer experiments demonstrating its competitiveness with state-of-the-art methods.
Paper Structure (15 sections, 7 figures, 2 tables, 1 algorithm)

This paper contains 15 sections, 7 figures, 2 tables, 1 algorithm.

Figures (7)

  • Figure 1: An example of 2-dimensional data (white points) presented in a context of ultimate layer activations of randomly initialized nn (top charts) and their mean activation (bottom chart) of four examined nn outputs during stream processing. Vibrant red areas in the top charts correspond to the high activation of a model, and vibrant blue to its low activation. The ticks on the horizontal axis of the bottom chart signal the moments of an abrupt concept change.
  • Figure 2: Drift detection error measures for a single 30-dimensional data stream with sudden concept drifts, depending on the values of two critical hyperparameters -- alpha and threshold, describing the method sensitivity. Red cells indicate high errors, while blue cells -- the low error.
  • Figure 3: The combination of drift detection error measures, normalized to a range 0-1 in each of three measures, presented as an RGB image. Black cells indicate the lowest error across all three measures.
  • Figure 4: Detection moments of all evaluated methods for streams with sudden concept drifts. The actual drifts are marked with ticks on the horizontal axis and particular methods's detections in ten stream replications with black or red (in the case of the proposed approach) points. The ideal drift detection would result in vertical lines overlapping with ticks on x-axis.
  • Figure 5: Detection moments of all evaluated methods for streams with gradual concept drifts. The ticks on x-axis indicate the central point of each concept drift.
  • ...and 2 more figures