M$^2$OE$^2$-GL: A Family of Probabilistic Load Forecasters That Scales to Massive Customers
Haoran Li, Zhe Cheng, Muhao Guo, Yang Weng, Yannan Sun, Victor Tran, John Chainaranont
TL;DR
The paper tackles scalable probabilistic load forecasting for thousands of heterogeneous customer groups by extending the M$^2$OE$^2$ backbone to a global-to-local framework. It pretrains a single global model $f_{oldsymbol{ heta_0}}$ on all data and uses lightweight LoRA-based adapters $oldsymbol{}_g$ to tailor output heads, forming a family of forecasts $f_{oldsymbol{ heta_0},oldsymbol{}_g}$. This approach maintains a compact per-group footprint while achieving substantial accuracy gains, demonstrated by 30–50% improvements over the base model on real feeder data. The method offers practical deployment advantages for utilities by enabling scalable, uncertainty-aware forecasts across massive numbers of loads with reduced storage and compute requirements.
Abstract
Probabilistic load forecasting is widely studied and underpins power system planning, operation, and risk-aware decision making. Deep learning forecasters have shown strong ability to capture complex temporal and contextual patterns, achieving substantial accuracy gains. However, at the scale of thousands or even hundreds of thousands of loads in large distribution feeders, a deployment dilemma emerges: training and maintaining one model per customer is computationally and storage intensive, while using a single global model ignores distributional shifts across customer types, locations, and phases. Prior work typically focuses on single-load forecasters, global models across multiple loads, or adaptive/personalized models for relatively small settings, and rarely addresses the combined challenges of heterogeneity and scalability in large feeders. We propose M2OE2-GL, a global-to-local extension of the M2OE2 probabilistic forecaster. We first pretrain a single global M2OE2 base model across all feeder loads, then apply lightweight fine-tuning to derive a compact family of group-specific forecasters. Evaluated on realistic utility data, M2OE2-GL yields substantial error reductions while remaining scalable to very large numbers of loads.
