N-BEATS: Neural basis expansion analysis for interpretable time series forecasting
Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, Yoshua Bengio
TL;DR
N-BEATS introduces a pure deep learning architecture for univariate time series forecasting that uses backward/forward residual blocks and a deep FC stack to achieve state-of-the-art results on M3, M4, and Tourism datasets without time-series-specific features. It provides two configurations—generic and interpretable—where the latter imposes trend and seasonality inductive biases via polynomial and Fourier bases to yield decomposed, human-understandable forecasts. The architecture supports extensive ensembling and multi-horizon training, and is framed within a meta-learning perspective to explain its generalization and transfer capabilities. Overall, the work demonstrates that deep learning can outperform traditional statistical methods in broad TS forecasting tasks and that interpretability can be achieved without sacrificing accuracy, with practical implications for forecasting practice and research into meta-learning analogies.
Abstract
We focus on solving the univariate times series point forecasting problem using deep learning. We propose a deep neural architecture based on backward and forward residual links and a very deep stack of fully-connected layers. The architecture has a number of desirable properties, being interpretable, applicable without modification to a wide array of target domains, and fast to train. We test the proposed architecture on several well-known datasets, including M3, M4 and TOURISM competition datasets containing time series from diverse domains. We demonstrate state-of-the-art performance for two configurations of N-BEATS for all the datasets, improving forecast accuracy by 11% over a statistical benchmark and by 3% over last year's winner of the M4 competition, a domain-adjusted hand-crafted hybrid between neural network and statistical time series models. The first configuration of our model does not employ any time-series-specific components and its performance on heterogeneous datasets strongly suggests that, contrarily to received wisdom, deep learning primitives such as residual blocks are by themselves sufficient to solve a wide range of forecasting problems. Finally, we demonstrate how the proposed architecture can be augmented to provide outputs that are interpretable without considerable loss in accuracy.
