HUGO -- Highlighting Unseen Grid Options: Combining Deep Reinforcement Learning with a Heuristic Target Topology Approach
Malte Lehna, Clara Holzhüter, Sven Tomforde, Christoph Scholz
TL;DR
This work addresses grid control under high renewable penetration by moving from substation-level actions to holistic topology optimization via Target Topologies (TTs). It introduces a search method to identify robust TT configurations and upgrades the CurriculumAgent to a Topology Agent with a greedy TT-focused component, demonstrating significant improvements on the WCCI 2022 Grid2Op IEEE118 benchmark (mean score >10% higher; median survival ≈25% higher). The analysis reveals that most effective TT configurations remain close to the base topology, suggesting inherent robustness in near-base topologies. The results support adopting TT-based topology optimization as a practical, scalable enhancement for automated grid operation, with potential for integration into hierarchical control frameworks and future data reuse for training topology-specific agents.
Abstract
With the growth of Renewable Energy (RE) generation, the operation of power grids has become increasingly complex. One solution could be automated grid operation, where Deep Reinforcement Learning (DRL) has repeatedly shown significant potential in Learning to Run a Power Network (L2RPN) challenges. However, only individual actions at the substation level have been subjected to topology optimization by most existing DRL algorithms. In contrast, we propose a more holistic approach by proposing specific Target Topologies (TTs) as actions. These topologies are selected based on their robustness. As part of this paper, we present a search algorithm to find the TTs and upgrade our previously developed DRL agent CurriculumAgent (CAgent) to a novel topology agent. We compare the upgrade to the previous CAgent and can increase their L2RPN score significantly by 10%. Further, we achieve a 25% better median survival time with our TTs included. Later analysis shows that almost all TTs are close to the base topology, explaining their robustness
