Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting

Liran Nochumsohn; Raz Marshanski; Hedi Zisling; Omri Azencot

Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting

Liran Nochumsohn, Raz Marshanski, Hedi Zisling, Omri Azencot

TL;DR

Super-Linear targets efficient generalization for time series forecasting by marrying frequency-specialized linear experts with a lightweight spectral gating mechanism. By pretraining a diverse set of univariate linear experts on resampled data across multiple frequencies and training a sparse router to select relevant experts, the model achieves strong zero-shot and full-shot performance while dramatically reducing parameters and inference time. The work provides theoretical bias–variance insights for the gating mechanism and demonstrates robust generalization across diverse benchmarks and sampling rates, with interpretable expert activations linked to data frequency. Overall, Super-Linear offers a practical, scalable alternative to large Transformer-based TSF foundations, delivering competitive accuracy with substantial efficiency gains and interpretability.

Abstract

Time series forecasting (TSF) is critical in domains like energy, finance, healthcare, and logistics, requiring models that generalize across diverse datasets. Large pre-trained models such as Chronos and Time-MoE show strong zero-shot (ZS) performance but suffer from high computational costs. In this work, we introduce Super-Linear, a lightweight and scalable mixture-of-experts (MoE) model for general forecasting. It replaces deep architectures with simple frequency-specialized linear experts, trained on resampled data across multiple frequency regimes. A lightweight spectral gating mechanism dynamically selects relevant experts, enabling efficient, accurate forecasting. Despite its simplicity, Super-Linear demonstrates strong performance across benchmarks, while substantially improving efficiency, robustness to sampling rates, and interpretability. The implementation of Super-Linear is available at: \href{https://github.com/azencot-group/SuperLinear}{https://github.com/azencot-group/SuperLinear}.

Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting

TL;DR

Abstract

Super-Linear: A Lightweight Pretrained Mixture of Linear Experts for Time Series Forecasting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)