Reinforcement Learning Optimizes Power Dispatch in Decentralized Power Grid
Yongsun Lee, Hoyun Choi, Laurent Pagnier, Cook Hyun Kim, Jongshin Lee, Bukyoung Jhun, Heetae Kim, Juergen Kurths, B. Kahng
TL;DR
The paper addresses frequency stability in decentralized power grids with high renewable penetration by introducing GC-PPO, a graph-convolutional proximal policy optimization framework that outputs a distributed dispatch plan to minimize frequency fluctuations modeled by the swing equation. It combines graph neural networks with PPO to produce bus-level dispatch fractions $\delta P_{ji}$ in response to perturbations, and demonstrates superior performance over heuristic methods on SHK-like synthetic grids and a Kron-reduced UK grid, with stability measured by the fluctuation metric $\Xi$. The study highlights the importance of topology-aware, inertia-weighted dispatch in heterogeneous grids and discusses extensions to topology switching and proactive fault handling, offering a scalable path toward robust, decentralized frequency control. The approach provides a practical framework for rapid, topology-adaptive stabilization in modern grids with distributed renewable generation, potentially reducing blackout risk and improving reliability.
Abstract
Effective frequency control in power grids has become increasingly important with the increasing demand for renewable energy sources. Here, we propose a novel strategy for resolving this challenge using graph convolutional proximal policy optimization (GC-PPO). The GC-PPO method can optimally determine how much power individual buses dispatch to reduce frequency fluctuations across a power grid. We demonstrate its efficacy in controlling disturbances by applying the GC-PPO to the power grid of the UK. The performance of GC-PPO is outstanding compared to the classical methods. This result highlights the promising role of GC-PPO in enhancing the stability and reliability of power systems by switching lines or decentralizing grid topology.
