When, How Long and How Much? Interpretable Neural Networks for Time Series Regression by Learning to Mask and Aggregate

Florent Forest; Amaury Wei; Olga Fink

When, How Long and How Much? Interpretable Neural Networks for Time Series Regression by Learning to Mask and Aggregate

Florent Forest, Amaury Wei, Olga Fink

TL;DR

This work tackles the interpretability gap in time series extrinsic regression by introducing MAGNETS, a mask-and-aggregate neural architecture that discovers concepts without supervision. MAGNETS identifies input-dependent temporal regions via masks, aggregates them into a compact set of interpretable concepts, and combines them linearly to predict the target, enabling faithful, transparent reasoning. Across synthetic and real-world datasets, MAGNETS matches or exceeds interpretable baselines and remains competitive with black-box models, while delivering explanations that faithfully reflect temporal localization and multivariate interactions. This approach advances trustworthy TSER by providing both accurate predictions and easily inspectable, concept-based reasoning paths.

Abstract

Time series extrinsic regression (TSER) refers to the task of predicting a continuous target variable from an input time series. It appears in many domains, including healthcare, finance, environmental monitoring, and engineering. In these settings, accurate predictions and trustworthy reasoning are both essential. Although state-of-the-art TSER models achieve strong predictive performance, they typically operate as black boxes, making it difficult to understand which temporal patterns drive their decisions. Post-hoc interpretability techniques, such as feature attribution, aim to to explain how the model arrives at its predictions, but often produce coarse, noisy, or unstable explanations. Recently, inherently interpretable approaches based on concepts, additive decompositions, or symbolic regression, have emerged as promising alternatives. However, these approaches remain limited: they require explicit supervision on the concepts themselves, often cannot capture interactions between time-series features, lack expressiveness for complex temporal patterns, and struggle to scale to high-dimensional multivariate data. To address these limitations, we propose MAGNETS (Mask-and-AGgregate NEtwork for Time Series), an inherently interpretable neural architecture for TSER. MAGNETS learns a compact set of human-understandable concepts without requiring any annotations. Each concept corresponds to a learned, mask-based aggregation over selected input features, explicitly revealing both which features drive predictions and when they matter in the sequence. Predictions are formed as combinations of these learned concepts through a transparent, additive structure, enabling clear insight into the model's decision process.

When, How Long and How Much? Interpretable Neural Networks for Time Series Regression by Learning to Mask and Aggregate

TL;DR

Abstract

When, How Long and How Much? Interpretable Neural Networks for Time Series Regression by Learning to Mask and Aggregate

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)