Exploring the Capabilities and Limitations of Large Language Models in the Electric Energy Sector
Subir Majumder, Lin Dong, Fatemeh Doudi, Yuting Cai, Chao Tian, Dileep Kalathi, Kevin Ding, Anupam A. Thatte, Na Li, Le Xie
TL;DR
The paper investigates how large language models can assist the electric energy sector by evaluating their capabilities and limitations across tasks such as load forecasting, power-flow analysis, and document QA. It leverages in-weight learning, prompt engineering, tool embedding, and retrieval-augmented generation to demonstrate practical workflows, including multi-modal analyses and integration with power-system tools. The study identifies data, safety, physics-alignment, and cybersecurity as central challenges, and proposes concrete future directions—curated domain data, tool embeddings, robust knowledge bases, and uncertainty quantification—to enable LLMs as decision-support copilots in power systems. The findings highlight the potential for LLMs to augment human operators while emphasizing governance and physics-based constraints to ensure safe, reliable operation of electric infrastructure.
Abstract
Large Language Models (LLMs) as chatbots have drawn remarkable attention thanks to their versatile capability in natural language processing as well as in a wide range of tasks. While there has been great enthusiasm towards adopting such foundational model-based artificial intelligence tools in all sectors possible, the capabilities and limitations of such LLMs in improving the operation of the electric energy sector need to be explored, and this article identifies fruitful directions in this regard. Key future research directions include data collection systems for fine-tuning LLMs, embedding power system-specific tools in the LLMs, and retrieval augmented generation (RAG)-based knowledge pool to improve the quality of LLM responses and LLMs in safety-critical use cases.
