Explainable Adaptive Tree-based Model Selection for Time Series Forecasting

Matthias Jakobs; Amal Saadallah

Explainable Adaptive Tree-based Model Selection for Time Series Forecasting

Matthias Jakobs, Amal Saadallah

TL;DR

This work proposes a novel method for the online selection of tree-based models using the TreeSHAP explainability method in the task of time series forecasting, and outlines a performance-based ranking with a coherent design to make TreeSHAP able to specialize the tree-based forecasters across different regions in the input time series.

Abstract

Tree-based models have been successfully applied to a wide variety of tasks, including time series forecasting. They are increasingly in demand and widely accepted because of their comparatively high level of interpretability. However, many of them suffer from the overfitting problem, which limits their application in real-world decision-making. This problem becomes even more severe in online-forecasting settings where time series observations are incrementally acquired, and the distributions from which they are drawn may keep changing over time. In this context, we propose a novel method for the online selection of tree-based models using the TreeSHAP explainability method in the task of time series forecasting. We start with an arbitrary set of different tree-based models. Then, we outline a performance-based ranking with a coherent design to make TreeSHAP able to specialize the tree-based forecasters across different regions in the input time series. In this framework, adequate model selection is performed online, adaptively following drift detection in the time series. In addition, explainability is supported on three levels, namely online input importance, model selection, and model output explanation. An extensive empirical study on various real-world datasets demonstrates that our method achieves excellent or on-par results in comparison to the state-of-the-art approaches as well as several baselines.

Explainable Adaptive Tree-based Model Selection for Time Series Forecasting

TL;DR

Abstract

Paper Structure (13 sections, 15 equations, 5 figures, 4 tables)

This paper contains 13 sections, 15 equations, 5 figures, 4 tables.

Introduction
Related Work
Methodology
Preliminaries
Shapley values and TreeSHAP
TreeSHAP Model Selection (TSMS)
Experiments
Experimental Setup
TSMS Setup and Baselines
Performance Comparison Results
Explainability
Discussion and future work
Concluding Remarks

Figures (5)

Figure 1: Visualization of A2 and A3 by investigating the closest and furthest RoC members, in addition to Shapley values and feature decision boundaries.
Figure 2: Comparison of Shapley values for prediction before and after concept drift where the chosen forecaster changes.
Figure 3: Visualization for $\mathcal{R}_{17}$
Figure 4: Comparison of $\mathcal{R}_{10}$ before and after a drift. Closest RoC member shown in orange and red, respectively.
Figure 5: Comparison of $\mathcal{R}_{13}$ and $\mathcal{R}_{14}$ before and after a drift. Closest RoC member shown in orange and red, respectively.

Theorems & Definitions (1)

Definition 1

Explainable Adaptive Tree-based Model Selection for Time Series Forecasting

TL;DR

Abstract

Explainable Adaptive Tree-based Model Selection for Time Series Forecasting

Authors

TL;DR

Abstract

Table of Contents

Figures (5)

Theorems & Definitions (1)