Spectral State Space Models
Naman Agarwal, Daniel Suo, Xinyi Chen, Elad Hazan
TL;DR
The paper tackles the challenge of long-range sequence prediction by introducing Spectral State Space Models (SSMs), which use fixed spectral filters to capture long-range dependencies without reliance on the spectral gap or high dimensionality. It develops the Spectral Transform Unit (STU) and its stacked variants, combining a fixed spectral component with a small autoregressive portion to achieve stable, memory-efficient learning. The authors provide theoretical guarantees connecting spectral filtering to the expressive capacity of linear dynamical systems and validate the approach on synthetic data and the Long Range Arena benchmark, showing robustness and competitive performance without specialized initialization or normalization. Overall, the work offers a principled, computationally efficient alternative to Transformers for long-context modeling, with strong stability properties and practical applicability across modalities.
Abstract
This paper studies sequence modeling for prediction tasks with long range dependencies. We propose a new formulation for state space models (SSMs) based on learning linear dynamical systems with the spectral filtering algorithm (Hazan et al. (2017)). This gives rise to a novel sequence prediction architecture we call a spectral state space model. Spectral state space models have two primary advantages. First, they have provable robustness properties as their performance depends on neither the spectrum of the underlying dynamics nor the dimensionality of the problem. Second, these models are constructed with fixed convolutional filters that do not require learning while still outperforming SSMs in both theory and practice. The resulting models are evaluated on synthetic dynamical systems and long-range prediction tasks of various modalities. These evaluations support the theoretical benefits of spectral filtering for tasks requiring very long range memory.
