Echo Flow Networks
Hongbo Liu, Jia Xu
TL;DR
Echo Flow Networks (EFNs) address the challenge of modeling long-range temporal dependencies in time-series forecasting with high efficiency. By extending ESNs into Group X-ESNs and introducing Matrix-Gated Composite Random Activation (MCRA), EFNs significantly expand reservoir expressiveness while maintaining linear training complexity; a dual-stream fusion with cross-attention combines long-horizon dynamics with short-context patterns. The framework achieves state-of-the-art results on multiple benchmarks (e.g., EchoFormer yielding substantial MSE reductions on DMV, ETTh, Weather, and Traffic datasets) and demonstrates improved training speed and reduced model size relative to leading methods like PatchTST. This approach offers a scalable, robust alternative for ultra-long sequence forecasting, suitable as a standalone predictor or as a boosters for existing TSF models.
Abstract
At the heart of time-series forecasting (TSF) lies a fundamental challenge: how can models efficiently and effectively capture long-range temporal dependencies across ever-growing sequences? While deep learning has brought notable progress, conventional architectures often face a trade-off between computational complexity and their ability to retain accumulative information over extended horizons. Echo State Networks (ESNs), a class of reservoir computing models, have recently regained attention for their exceptional efficiency, offering constant memory usage and per-step training complexity regardless of input length. This makes them particularly attractive for modeling extremely long-term event history in TSF. However, traditional ESNs fall short of state-of-the-art performance due to their limited nonlinear capacity, which constrains both their expressiveness and stability. We introduce Echo Flow Networks (EFNs), a framework composed of a group of extended Echo State Networks (X-ESNs) with MLP readouts, enhanced by our novel Matrix-Gated Composite Random Activation (MCRA), which enables complex, neuron-specific temporal dynamics, significantly expanding the network's representational capacity without compromising computational efficiency. In addition, we propose a dual-stream architecture in which recent input history dynamically selects signature reservoir features from an infinite-horizon memory, leading to improved prediction accuracy and long-term stability. Extensive evaluations on five benchmarks demonstrate that EFNs achieve up to 4x faster training and 3x smaller model size compared to leading methods like PatchTST, reducing forecasting error from 43% to 35%, a 20% relative improvement. One instantiation of our framework, EchoFormer, consistently achieves new state-of-the-art performance across five benchmark datasets: ETTh, ETTm, DMV, Weather, and Air Quality.
