Addressing Heterogeneity in Federated Load Forecasting with Personalization Layers
Shourya Bose, Yu Zhang, Kibaek Kim
TL;DR
The paper tackles privacy-preserving short-term load forecasting in federated settings, where data heterogeneity across clients degrades standard FL performance. It introduces personalization layers (PL-FL), which partition model parameters into shared and personalized subsets so that only shared parameters are communicated, reducing bandwidth and better accommodating non-i.i.d data. Through an LSTM-based forecasting model evaluated on the NREL ComStock dataset across three regions, PL-FL with a personalized MLP and Adam optimizer achieves the best accuracy while lowering communication compared to conventional FL, albeit with some datasets remaining challenging. The results suggest PL-FL as a practical, bandwidth-efficient approach for privacy-conscious smart grid forecasting, with future work focusing on distributed inference for real-world deployment on edge devices.
Abstract
The advent of smart meters has enabled pervasive collection of energy consumption data for training short-term load forecasting models. In response to privacy concerns, federated learning (FL) has been proposed as a privacy-preserving approach for training, but the quality of trained models degrades as client data becomes heterogeneous. In this paper we propose the use of personalization layers for load forecasting in a general framework called PL-FL. We show that PL-FL outperforms FL and purely local training, while requiring lower communication bandwidth than FL. This is done through extensive simulations on three different datasets from the NREL ComStock repository.
