Table of Contents
Fetching ...

Sonnet: Spectral Operator Neural Network for Multivariable Time Series Forecasting

Yuxuan Shu, Vasileios Lampos

TL;DR

Sonnet introduces a spectral-operator approach to multivariable time series forecasting by combining learnable wavelet transforms with a frequency-domain Multivariable Coherence Attention (MVCA) to capture inter- and intra-variable dependencies. It further stabilizes temporal evolution through a Koopman operator and reconstructs forecasts via a wavelet-based decoder. Across 12 real-world datasets, Sonnet achieves the best performance on 34 of 47 tasks and yields notable MAE reductions, with MVCA alone delivering substantial improvements on challenging tasks like ILI forecasting. The work demonstrates that replacing naïve attention with MVCA, particularly in a spectral context, provides robust gains across diverse domains, though it remains empirically validated rather than theoretically proven.

Abstract

Multivariable time series forecasting methods can integrate information from exogenous variables, leading to significant prediction accuracy gains. The transformer architecture has been widely applied in various time series forecasting models due to its ability to capture long-range sequential dependencies. However, a naïve application of transformers often struggles to effectively model complex relationships among variables over time. To mitigate against this, we propose a novel architecture, termed Spectral Operator Neural Network (Sonnet). Sonnet applies learnable wavelet transformations to the input and incorporates spectral analysis using the Koopman operator. Its predictive skill relies on the Multivariable Coherence Attention (MVCA), an operation that leverages spectral coherence to model variable dependencies. Our empirical analysis shows that Sonnet yields the best performance on $34$ out of $47$ forecasting tasks with an average mean absolute error (MAE) reduction of $2.2\%$ against the most competitive baseline. We further show that MVCA can remedy the deficiencies of naïve attention in various deep learning models, reducing MAE by $10.7\%$ on average in the most challenging forecasting tasks.

Sonnet: Spectral Operator Neural Network for Multivariable Time Series Forecasting

TL;DR

Sonnet introduces a spectral-operator approach to multivariable time series forecasting by combining learnable wavelet transforms with a frequency-domain Multivariable Coherence Attention (MVCA) to capture inter- and intra-variable dependencies. It further stabilizes temporal evolution through a Koopman operator and reconstructs forecasts via a wavelet-based decoder. Across 12 real-world datasets, Sonnet achieves the best performance on 34 of 47 tasks and yields notable MAE reductions, with MVCA alone delivering substantial improvements on challenging tasks like ILI forecasting. The work demonstrates that replacing naïve attention with MVCA, particularly in a spectral context, provides robust gains across diverse domains, though it remains empirically validated rather than theoretically proven.

Abstract

Multivariable time series forecasting methods can integrate information from exogenous variables, leading to significant prediction accuracy gains. The transformer architecture has been widely applied in various time series forecasting models due to its ability to capture long-range sequential dependencies. However, a naïve application of transformers often struggles to effectively model complex relationships among variables over time. To mitigate against this, we propose a novel architecture, termed Spectral Operator Neural Network (Sonnet). Sonnet applies learnable wavelet transformations to the input and incorporates spectral analysis using the Koopman operator. Its predictive skill relies on the Multivariable Coherence Attention (MVCA), an operation that leverages spectral coherence to model variable dependencies. Our empirical analysis shows that Sonnet yields the best performance on out of forecasting tasks with an average mean absolute error (MAE) reduction of against the most competitive baseline. We further show that MVCA can remedy the deficiencies of naïve attention in various deep learning models, reducing MAE by on average in the most challenging forecasting tasks.

Paper Structure

This paper contains 46 sections, 4 equations, 13 figures, 16 tables.

Figures (13)

  • Figure S1: Computational cost (GPU memory) of Sonnet regarding the hidden dimension size $d$, the number of atoms $K$, the number of exogenous variables $C$, and the look back window length $L$.
  • Figure S2: 7-day ahead forecasts for all influenza seasons and models, England (ILI-ENG).
  • Figure S4: 21-day ahead forecasts for all influenza seasons and models, England (ILI-ENG).
  • Figure S6: 7-day ahead forecasts for all influenza seasons and models, US Region 2 (ILI-US2).
  • Figure S8: 21-day ahead forecasts for all influenza seasons and models, US Region 2 (ILI-US2).
  • ...and 8 more figures