Table of Contents
Fetching ...

Echo Flow Networks

Hongbo Liu, Jia Xu

TL;DR

Echo Flow Networks (EFNs) address the challenge of modeling long-range temporal dependencies in time-series forecasting with high efficiency. By extending ESNs into Group X-ESNs and introducing Matrix-Gated Composite Random Activation (MCRA), EFNs significantly expand reservoir expressiveness while maintaining linear training complexity; a dual-stream fusion with cross-attention combines long-horizon dynamics with short-context patterns. The framework achieves state-of-the-art results on multiple benchmarks (e.g., EchoFormer yielding substantial MSE reductions on DMV, ETTh, Weather, and Traffic datasets) and demonstrates improved training speed and reduced model size relative to leading methods like PatchTST. This approach offers a scalable, robust alternative for ultra-long sequence forecasting, suitable as a standalone predictor or as a boosters for existing TSF models.

Abstract

At the heart of time-series forecasting (TSF) lies a fundamental challenge: how can models efficiently and effectively capture long-range temporal dependencies across ever-growing sequences? While deep learning has brought notable progress, conventional architectures often face a trade-off between computational complexity and their ability to retain accumulative information over extended horizons. Echo State Networks (ESNs), a class of reservoir computing models, have recently regained attention for their exceptional efficiency, offering constant memory usage and per-step training complexity regardless of input length. This makes them particularly attractive for modeling extremely long-term event history in TSF. However, traditional ESNs fall short of state-of-the-art performance due to their limited nonlinear capacity, which constrains both their expressiveness and stability. We introduce Echo Flow Networks (EFNs), a framework composed of a group of extended Echo State Networks (X-ESNs) with MLP readouts, enhanced by our novel Matrix-Gated Composite Random Activation (MCRA), which enables complex, neuron-specific temporal dynamics, significantly expanding the network's representational capacity without compromising computational efficiency. In addition, we propose a dual-stream architecture in which recent input history dynamically selects signature reservoir features from an infinite-horizon memory, leading to improved prediction accuracy and long-term stability. Extensive evaluations on five benchmarks demonstrate that EFNs achieve up to 4x faster training and 3x smaller model size compared to leading methods like PatchTST, reducing forecasting error from 43% to 35%, a 20% relative improvement. One instantiation of our framework, EchoFormer, consistently achieves new state-of-the-art performance across five benchmark datasets: ETTh, ETTm, DMV, Weather, and Air Quality.

Echo Flow Networks

TL;DR

Echo Flow Networks (EFNs) address the challenge of modeling long-range temporal dependencies in time-series forecasting with high efficiency. By extending ESNs into Group X-ESNs and introducing Matrix-Gated Composite Random Activation (MCRA), EFNs significantly expand reservoir expressiveness while maintaining linear training complexity; a dual-stream fusion with cross-attention combines long-horizon dynamics with short-context patterns. The framework achieves state-of-the-art results on multiple benchmarks (e.g., EchoFormer yielding substantial MSE reductions on DMV, ETTh, Weather, and Traffic datasets) and demonstrates improved training speed and reduced model size relative to leading methods like PatchTST. This approach offers a scalable, robust alternative for ultra-long sequence forecasting, suitable as a standalone predictor or as a boosters for existing TSF models.

Abstract

At the heart of time-series forecasting (TSF) lies a fundamental challenge: how can models efficiently and effectively capture long-range temporal dependencies across ever-growing sequences? While deep learning has brought notable progress, conventional architectures often face a trade-off between computational complexity and their ability to retain accumulative information over extended horizons. Echo State Networks (ESNs), a class of reservoir computing models, have recently regained attention for their exceptional efficiency, offering constant memory usage and per-step training complexity regardless of input length. This makes them particularly attractive for modeling extremely long-term event history in TSF. However, traditional ESNs fall short of state-of-the-art performance due to their limited nonlinear capacity, which constrains both their expressiveness and stability. We introduce Echo Flow Networks (EFNs), a framework composed of a group of extended Echo State Networks (X-ESNs) with MLP readouts, enhanced by our novel Matrix-Gated Composite Random Activation (MCRA), which enables complex, neuron-specific temporal dynamics, significantly expanding the network's representational capacity without compromising computational efficiency. In addition, we propose a dual-stream architecture in which recent input history dynamically selects signature reservoir features from an infinite-horizon memory, leading to improved prediction accuracy and long-term stability. Extensive evaluations on five benchmarks demonstrate that EFNs achieve up to 4x faster training and 3x smaller model size compared to leading methods like PatchTST, reducing forecasting error from 43% to 35%, a 20% relative improvement. One instantiation of our framework, EchoFormer, consistently achieves new state-of-the-art performance across five benchmark datasets: ETTh, ETTm, DMV, Weather, and Air Quality.

Paper Structure

This paper contains 40 sections, 13 equations, 16 figures, 11 tables, 1 algorithm.

Figures (16)

  • Figure 1: EFNs Framework with X-ESNs, and MCRA ($\mathbf{W}_1$,$\mathbf{W}_2$, $\sigma$).
  • Figure 2: MSE vs. different initializations. Group EFNs (e.g., 10 EFNs) improve stability and error rates over a single EFNs on ETTh1. EchoFormer and EchoSolo are not sensitive to initialization.
  • Figure 3: MSE vs. EFNs numbers. Optimal EFNs Numbers in Group EFNs converge around $10$ across datasets ETTh1, ETTh2, and Weather.
  • Figure 4: MSE vs. horizon length: EchoFormer outperforms the baseline across all horizons, with widening margins as the horizon extends, on validation set (horizon $720$ for all ablations by default).
  • Figure 5: Model training efficiency. EchoFormer and EchoMLP converge more quickly and achieve lower error rates than baselines on the ETTh1 validation set.
  • ...and 11 more figures