Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
Aurélien Renault, Alexis Bondu, Antoine Cornuéjols, Vincent Lemaire
TL;DR
This work addresses Early Classification of Time Series (ECTS) by translating the triggering decision problem into a Reinforcement Learning (RL) framework for separable architectures and introducing Alert, a Deep Q-Network based triggering function. By using the same feature sets as hand-crafted rules, the authors enable a fair comparison between man-tailored and RL-based triggering and demonstrate that Alert_star, a richer state-space variant, consistently outperforms state-of-the-art approaches across 31 datasets and a range of misclassification–delay costs. The results show that larger, well-chosen state spaces empower RL to learn more effective, non-linear triggering rules, though at the cost of interpretability. The study highlights the potential of RL to discover improved triggering policies for ECTS and suggests further work on explainability and broader state-space design to balance performance with transparency.
Abstract
Early Classification of Time Series (ECTS) has been recognized as an important problem in many areas where decisions have to be taken as soon as possible, before the full data availability, while time pressure increases. Numerous ECTS approaches have been proposed, based on different triggering functions, each taking into account various pieces of information related to the incoming time series and/or the output of a classifier. Although their performances have been empirically compared in the literature, no studies have been carried out on the optimality of these triggering functions that involve ``man-tailored'' decision rules. Based on the same information, could there be better triggering functions? This paper presents one way to investigate this question by showing first how to translate ECTS problems into Reinforcement Learning (RL) ones, where the very same information is used in the state space. A thorough comparison of the performance obtained by ``handmade'' approaches and their ``RL-based'' counterparts has been carried out. A second question investigated in this paper is whether a different combination of information, defining the state space in RL systems, can achieve even better performance. Experiments show that the system we describe, called \textsc{Alert}, significantly outperforms its state-of-the-art competitors on a large number of datasets.
