A Lightweight Multi-View Approach to Short-Term Load Forecasting
Julien Guité-Vinet, Alexandre Blondin Massé, Éric Beaudry
TL;DR
The paper addresses short-term load forecasting in settings with limited data and the risk of overfitting large models by introducing a lightweight multi-view framework. It combines single-value embeddings, a scaled time-range input of critical lags, and embedding dropout within a transformer-based encoder–decoder to fuse multiple exogenous views with minimal parameters. Empirically, the approach achieves competitive accuracy across four real-world datasets while reducing parameter counts substantially and providing interpretable insights into feature contributions. This yields robust, scalable forecasts suitable for practical deployment, with potential applicability to other time-series domains requiring efficient, explainable models.
Abstract
Time series forecasting is a critical task across domains such as energy, finance, and meteorology, where accurate predictions enable informed decision-making. While transformer-based and large-parameter models have recently achieved state-of-the-art results, their complexity can lead to overfitting and unstable forecasts, especially when older data points become less relevant. In this paper, we propose a lightweight multi-view approach to short-term load forecasting that leverages single-value embeddings and a scaled time-range input to capture temporally relevant features efficiently. We introduce an embedding dropout mechanism to prevent over-reliance on specific features and enhance interpretability. Our method achieves competitive performance with significantly fewer parameters, demonstrating robustness across multiple datasets, including scenarios with noisy or sparse data, and provides insights into the contributions of individual features to the forecast.
