Improving the Evaluation and Actionability of Explanation Methods for Multivariate Time Series Classification
Davide Italo Serramazza, Thach Le Nguyen, Georgiana Ifrim
TL;DR
The paper tackles the challenge of evaluating and making explanations for MTSC actionable. It analyzes InterpretTime, identifies weaknesses such as reliance on data augmentation and single-mask perturbations, and proposes improvements including multiple masking and time-series chunking. Through ground-truth alignment and real-world datasets, it shows perturbation-based methods, particularly SHAP and Feature Ablation, offer strong explanatory power and that channel-level explanations can meaningfully guide MTSC channel selection with data reduction and improved accuracy. The work advances practical XAI for MTSC and provides a path toward more reliable, task-driven explanations with broader applicability beyond mere visualization.
Abstract
Explanation for Multivariate Time Series Classification (MTSC) is an important topic that is under explored. There are very few quantitative evaluation methodologies and even fewer examples of actionable explanation, where the explanation methods are shown to objectively improve specific computational tasks on time series data. In this paper we focus on analyzing InterpretTime, a recent evaluation methodology for attribution methods applied to MTSC. We showcase some significant weaknesses of the original methodology and propose ideas to improve both its accuracy and efficiency. Unlike related work, we go beyond evaluation and also showcase the actionability of the produced explainer ranking, by using the best attribution methods for the task of channel selection in MTSC. We find that perturbation-based methods such as SHAP and Feature Ablation work well across a set of datasets, classifiers and tasks and outperform gradient-based methods. We apply the best ranked explainers to channel selection for MTSC and show significant data size reduction and improved classifier accuracy.
