Multi-agent Deep Reinforcement Learning for Distributed Load Restoration

Linh Vu; Tuyen Vu; Thanh-Long Vu; Anurag Srivastava

Multi-agent Deep Reinforcement Learning for Distributed Load Restoration

Linh Vu, Tuyen Vu, Thanh-Long Vu, Anurag Srivastava

TL;DR

This paper addresses load restoration after outages in distribution systems modeled as networked microgrids by proposing a multi-agent deep reinforcement learning framework with invalid action masking. A centralized-training, decentralized-execution architecture is used, where multiple DQN agents control individual microgrids and cooperate via a shared reward to maximize restored load under constraints expressed as $\sum_i \sum_j n_i P_{ij} w_{ij}$ with $V^{min}\le V_u\le V^{max}$, generator limits, and line-flow constraints. The key contributions include the first MARL approach to load restoration, the invalid action masking mechanism to ensure safety and manage large action spaces, and extensive validation on IEEE 13-, 123-, and 8500-node feeders showing faster learning and high restoration percentages (up to $98.6\%$, $96.04\%$, and $86.96\%$ of available generation, respectively). The results demonstrate improved learning stability and performance over single-agent baselines, while limitations such as topology changes requiring retraining and training-time growth with the number of agents are identified, with future work aimed at generalization and parallel training.

Abstract

This paper addresses the load restoration problem after power outage events. Our primary proposed methodology is using multi-agent deep reinforcement learning to optimize the load restoration process in distribution systems, modeled as networked microgrids, via determining the optimal operational sequence of circuit breakers (switches). An innovative invalid action masking technique is incorporated into the multi-agent method to handle both the physical constraints in the restoration process and the curse of dimensionality as the action space of operational decisions grows exponentially with the number of circuit breakers. The features of our proposed method include centralized training for multi-agents to overcome non-stationary environment problems, decentralized execution to ease the deployment, and zero constraint violations to prevent harmful actions. Our simulations are performed in OpenDSS and Python environments to demonstrate the effectiveness of the proposed approach using the IEEE 13, 123, and 8500-node distribution test feeders. The results show that the proposed algorithm can achieve a significantly better learning curve and stability than the conventional methods.

Multi-agent Deep Reinforcement Learning for Distributed Load Restoration

TL;DR

with

, generator limits, and line-flow constraints. The key contributions include the first MARL approach to load restoration, the invalid action masking mechanism to ensure safety and manage large action spaces, and extensive validation on IEEE 13-, 123-, and 8500-node feeders showing faster learning and high restoration percentages (up to

, and

of available generation, respectively). The results demonstrate improved learning stability and performance over single-agent baselines, while limitations such as topology changes requiring retraining and training-time growth with the number of agents are identified, with future work aimed at generalization and parallel training.

Abstract

Paper Structure (15 sections, 6 equations, 19 figures, 1 table, 1 algorithm)

This paper contains 15 sections, 6 equations, 19 figures, 1 table, 1 algorithm.

Introduction
Multi-agent Reinforcement Learning for Load Restoration
Load Restoration Problem
Multi-agent DQN Formulation for Load Restoration
Epsilon-greedy Method
Invalid Action Masking Technique
Case Studies
IEEE 13 Nodes
IEEE 123 Nodes
IEEE 8500 Nodes
Comparative Analyses
Single-agent and multi-agent load restoration
With and without invalid action masking technique
Limitations
Conclusion

Figures (19)

Figure 1: The concept of resiliency curve during an extreme event.9205672
Figure 2: The overall diagram of multi-agent DRL framework.
Figure 3: An example of an agent's ANN structure within a multi-agent framework.
Figure 4: The training process of an agent within a multi-agent framework.
Figure 5: Visualization of action masking.
...and 14 more figures

Multi-agent Deep Reinforcement Learning for Distributed Load Restoration

TL;DR

Abstract

Multi-agent Deep Reinforcement Learning for Distributed Load Restoration

Authors

TL;DR

Abstract

Table of Contents

Figures (19)