AALF: Almost Always Linear Forecasting
Matthias Jakobs, Thomas Liebig
TL;DR
AALF addresses the tension between interpretability and predictive accuracy in time-series forecasting by online model selection between an autoregressive forecaster and a deep-learning model at each time step, guided by a learnable binary classifier under an interpretability constraint $B$ (i.e., using the interpretable model at least $B$ times). The framework derives the optimal per-step choice using a loss difference $\\ell(t)$ and trains a classifier on features that summarize model disagreement and history to approximate this optimal policy. Empirically, AALF achieves competitive RMSE/SMAPE with significantly greater interpretability across 6 real-world datasets and 3500+ time series, outperforming or matching state-of-the-art online selectors for many configurations, particularly at moderate interpretability levels ($p=B/T$). The approach is generic, scalable, and extensible, with future directions including multivariate extensions and constraint-guaranteed strategies, offering practical impact for safer, auditable forecasting in high-stakes settings.
Abstract
Recent works for time-series forecasting more and more leverage the high predictive power of Deep Learning models. With this increase in model complexity, however, comes a lack in understanding of the underlying model decision process, which is problematic for high-stakes application scenarios. At the same time, simple, interpretable forecasting methods such as ARIMA still perform very well, sometimes on-par, with Deep Learning approaches. We argue that simple models are good enough most of the time, and that forecasting performance could be improved by choosing a Deep Learning method only for few, important predictions, increasing the overall interpretability of the forecasting process. In this context, we propose a novel online model selection framework which learns to identify these predictions. An extensive empirical study on various real-world datasets shows that our selection methodology performs comparable to state-of-the-art online model selections methods in most cases while being significantly more interpretable. We find that almost always choosing a simple autoregressive linear model for forecasting results in competitive performance, suggesting that the need for opaque black-box models in time-series forecasting might be smaller than recent works would suggest.
