Gradient-based Model Shortcut Detection for Time Series Classification
Salomon Ibarra, Frida Cantu, Kaixiong Zhou, Li Zhang
TL;DR
The paper addresses the problem of point-based shortcut learning in time series classification, showing that simple, spurious patterns can mislead deep models. It introduces the Shortcut Aggregate Gradient (SAG) score, a gradient-based method that aggregates input gradients from a pretrained model to detect class-level shortcuts without relying on test data or external attributes. Empirical results on UCR two-class datasets demonstrate high precision in identifying shortcut classes and datasets, with a clear demonstration on GunPoint and a SonyAIBO Robot Surface case study. The work provides a practical tool for diagnosing internal model biases in time series and offers a pathway toward more robust, reliable models in critical domains.
Abstract
Deep learning models have attracted lots of research attention in time series classification (TSC) task in the past two decades. Recently, deep neural networks (DNN) have surpassed classical distance-based methods and achieved state-of-the-art performance. Despite their promising performance, deep neural networks (DNNs) have been shown to rely on spurious correlations present in the training data, which can hinder generalization. For instance, a model might incorrectly associate the presence of grass with the label ``cat" if the training set have majority of cats lying in grassy backgrounds. However, the shortcut behavior of DNNs in time series remain under-explored. Most existing shortcut work are relying on external attributes such as gender, patients group, instead of focus on the internal bias behavior in time series models. In this paper, we take the first step to investigate and establish point-based shortcut learning behavior in deep learning time series classification. We further propose a simple detection method based on other class to detect shortcut occurs without relying on test data or clean training classes. We test our proposed method in UCR time series datasets.
