Adaptive Nonlinear Vector Autoregression: Robust Forecasting for Noisy Chaotic Time Series
Azimov Sherkhon, Susana Lopez-Moreno, Eric Dolores-Cuenca, Sieun Lee, Sangil Kim
TL;DR
Adaptive NVAR replaces fixed nonlinear feature bases with a trainable, shallow MLP to learn data-driven nonlinearities while jointly optimizing the linear readout. By incorporating a skip-connection and end-to-end gradient-based training, it achieves robust forecasting of chaotic time series under noise and scales to high-dimensional systems without explicit large-matrix inversions. Across Mackey–Glass, Lorenz–63, and Lorenz–96, it outperforms standard NVAR, leaky ESN, and Hybrid ESN, with particular strength in noisy and high-dimensional regimes. The approach enables GPU-accelerated training and demonstrates practical applicability to geophysical and other complex time-series forecasting tasks.
Abstract
Nonlinear vector autoregression (NVAR) and reservoir computing (RC) have shown promise in forecasting chaotic dynamical systems, such as the Lorenz-63 model and El Nino-Southern Oscillation. However, their reliance on fixed nonlinear transformations - polynomial expansions in NVAR or random feature maps in RC - limits their adaptability to high noise or complex real-world data. Furthermore, these methods also exhibit poor scalability in high-dimensional settings due to costly matrix inversion during optimization. We propose a data-adaptive NVAR model that combines delay-embedded linear inputs with features generated by a shallow, trainable multilayer perceptron (MLP). Unlike standard NVAR and RC models, the MLP and linear readout are jointly trained using gradient-based optimization, enabling the model to learn data-driven nonlinearities, while preserving a simple readout structure and improving scalability. Initial experiments across multiple chaotic systems, tested under noise-free and synthetically noisy conditions, showed that the adaptive model outperformed in predictive accuracy the standard NVAR, a leaky echo state network (ESN) - the most common RC model - and a hybrid ESN, thereby showing robust forecasting under noisy conditions.
