FocusLearn: Fully-Interpretable, High-Performance Modular Neural Networks for Time Series
Qiqi Su, Christos Kloukinas, Artur d'Avila Garcez
TL;DR
FocusLearn tackles the interpretability gap in time-series modeling by integrating an RNN-based temporal encoder with an attention-driven feature selector and a bank of modular neural networks. The architecture yields additive, feature-wise explanations while achieving predictive performance on par with top non-interpretable methods like LSTM and XGBoost, and outperforming interpretable baselines NAM and SPAM. Key innovations include Attention-Based Feature Selection (AFS) and Attention-Based Node Bootstrapping (ANB), which guide and weight modular units to produce faithful, NAM-like explanations. The approach offers practical impact by delivering transparent predictions and reliable explanations for complex multivariate time-series tasks, with strong empirical results across multiple datasets and tasks.
Abstract
Multivariate time series have many applications, from healthcare and meteorology to life science. Although deep learning models have shown excellent predictive performance for time series, they have been criticised for being "black-boxes" or non-interpretable. This paper proposes a novel modular neural network model for multivariate time series prediction that is interpretable by construction. A recurrent neural network learns the temporal dependencies in the data while an attention-based feature selection component selects the most relevant features and suppresses redundant features used in the learning of the temporal dependencies. A modular deep network is trained from the selected features independently to show the users how features influence outcomes, making the model interpretable. Experimental results show that this approach can outperform state-of-the-art interpretable Neural Additive Models (NAM) and variations thereof in both regression and classification of time series tasks, achieving a predictive performance that is comparable to the top non-interpretable methods for time series, LSTM and XGBoost.
