Unlocking Multi-Task Electric Energy System Intelligence: Data Scaling Laws and Performance with Limited Fine-Tuning

Shaohuai Liu; Lin Dong; Chao Tian; Le Xie

Unlocking Multi-Task Electric Energy System Intelligence: Data Scaling Laws and Performance with Limited Fine-Tuning

Shaohuai Liu, Lin Dong, Chao Tian, Le Xie

TL;DR

This work addresses the challenge of creating power-system foundation models that generalize across tasks and unseen operational scenarios. It develops a data-centric approach, demonstrating that scenario-generalization performance scales approximately as a power law with the amount of fine-tuning data, and that multi-task training preserves gains with limited interference. The study further shows that small models can achieve strong results and that parameter scaling yields limited benefits in this domain, highlighting data quality and task design as primary drivers of performance. Overall, the findings suggest data-efficient pathways to deploy robust, multi-task, cross-timescale AI for power systems, even with synthetic data and single-topology focus.

Abstract

Data scaling has revolutionized research fields like natural language processing, computer vision, and robotics control, providing foundation models with remarkable multi-task and generalization capabilities. In this paper, we investigate whether similar data scaling laws exist in developing foundation models for power systems, and whether appropriate data scaling can yield multi-task, cross-timescales capabilities that can be deployed in \textit{unseen} operational scenarios. To this end, we conducted a comprehensive empirical study on data scaling by fine-tuning open-source foundation models using labeled data collected from diverse operational tasks and scenarios. We study how a foundation model's scenario generalization performance evolves with the number of training tasks, scenarios, and demonstrations. Our study involved collecting more than 450k demonstrations and implementing independent tests under a rigorous evaluation framework. Our findings reveal several key insights: First, the generalization performance of a fine-tuned foundation model follows an approximate power-law relationship with the number of demonstrations and scenarios. Second, the fine-tuned model also demonstrates impressive multi-task capabilities, where multi-task training shares similar performance improvements with single-task training as the number of demonstrations increases, without interference among tasks. Lastly, models with small parameter sizes could have strong performance as well. Model performance does not scale significantly with parameter size. These findings underscore the feasibility of developing multi-task foundation models tailored for power systems, demonstrating that while larger datasets and models generally improve performance, extreme scaling is unnecessary to achieve satisfactory outcomes.

Unlocking Multi-Task Electric Energy System Intelligence: Data Scaling Laws and Performance with Limited Fine-Tuning

TL;DR

Abstract

Unlocking Multi-Task Electric Energy System Intelligence: Data Scaling Laws and Performance with Limited Fine-Tuning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)