Reinforcement Learning for Distributed Transient Frequency Control with Stability and Safety Guarantees
Zhenyi Yuan, Changhong Zhao, Jorge Cortes
TL;DR
The paper tackles transient frequency control in power networks with disturbances, aiming to maintain frequencies within safe bounds while ensuring asymptotic stability. It combines Lyapunov-based safety constraints with reinforcement learning, introducing a distributed dynamic budget mechanism that yields a provably safe and flexible policy search space. Neural networks parameterize class-$\mathcal{K}$ functions and frequency thresholds, and an RNN-based RL algorithm learns the optimal distributed policy (RLb) within this space. Case studies on the IEEE 39-bus network demonstrate guaranteed stability and transient safety, with improved optimality and robustness compared to baselines, highlighting the practical impact for secure, data-driven grid operation.
Abstract
This paper proposes a reinforcement learning-based approach for optimal transient frequency control in power systems with stability and safety guarantees. Building on Lyapunov stability theory and safety-critical control, we derive sufficient conditions on the distributed controller design that ensure the stability and transient frequency safety of the closed-loop system. Our idea of distributed dynamic budget assignment makes these conditions less conservative than those in recent literature, so that they can impose less stringent restrictions on the search space of control policies. We construct neural network controllers that parameterize such control policies and use reinforcement learning to train an optimal one. Simulations on the IEEE 39-bus network illustrate the guaranteed stability and safety properties of the controller along with its significantly improved optimality.
