Concept Extraction for Time Series with ECLAD-ts
Antonia Holzapfel, Andres Felipe Posada-Moreno, Sebastian Trimpe
TL;DR
The paper tackles the challenge of explaining CNN-based time series classifiers by introducing ECLAD-ts, a post-hoc global Concept Extraction method tailored to multivariate time series. It builds on the ECLAD framework by encoding latent space with Local Aggregated Descriptors (LADs), clustering these representations to form concepts, and computing a channel-aware relevance score based on model gradients, enabling both temporal and channel-wise localization. Through synthetic datasets with predefined primitives and real-world datasets like GunPoint and P2S, the authors demonstrate that ECLAD-ts yields higher representation and importance correctness than baselines (ECLAD, ConceptShap, MultiVISION), and effectively reveals shortcut learning and the features driving predictions. The approach offers practical value for model transparency, enabling practitioners to inspect which patterns and channels most influence decisions, thereby improving trust, debugging, and deployment in safety-critical domains. It also highlights limitations and future directions, such as visualization challenges and hyperparameter sensitivity, guiding further refinement of time-series explainability methods.
Abstract
Convolutional neural networks (CNNs) for time series classification (TSC) are being increasingly used in applications ranging from quality prediction to medical diagnosis. The black box nature of these models makes understanding their prediction process difficult. This issue is crucial because CNNs are prone to learning shortcuts and biases, compromising their robustness and alignment with human expectations. To assess whether such mechanisms are being used and the associated risk, it is essential to provide model explanations that reflect the inner workings of the model. Concept Extraction (CE) methods offer such explanations, but have mostly been developed for the image domain so far, leaving a gap in the time series domain. In this work, we present a CE and localization method tailored to the time series domain, based on the ideas of CE methods for images. We propose the novel method ECLAD-ts, which provides post-hoc global explanations based on how the models encode subsets of the input at different levels of abstraction. For this, concepts are produced by clustering timestep-wise aggregations of CNN activation maps, and their importance is computed based on their impact on the prediction process. We evaluate our method on synthetic and natural datasets. Furthermore, we assess the advantages and limitations of CE in time series through empirical results. Our results show that ECLAD-ts effectively explains models by leveraging their internal representations, providing useful insights about their prediction process.
