Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning

Rixin Wu; Ran Wang; Jie Hao; Qiang Wu; Ping Wang

Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning

Rixin Wu, Ran Wang, Jie Hao, Qiang Wu, Ping Wang

TL;DR

This work tackles multiobjective multireservoir operation by jointly optimizing power generation, ecological protection, and water supply. It introduces a transformer-based deep reinforcement learning (T-DRL) framework with a two-stage encoder to efficiently embed high-dimensional reservoir and residential-area information, and employs a decomposition strategy that converts the MMROO problem into 171 subproblems via weighted aggregation. Case studies on Lake Powell and Lake Mead show that the Two-stage T-DRL delivers 10.11% more electricity, reduces the AAPFD by 39.69%, and increases water-supply revenue by 4.10% relative to a state-of-the-art method. The results demonstrate the method’s effectiveness, scalability, and potential to improve hydropower reservoir management while safeguarding ecological health and water services.

Abstract

Due to shortage of water resources and increasing water demands, the joint operation of multireservoir systems for balancing power generation, ecological protection, and the residential water supply has become a critical issue in hydropower management. However, the numerous constraints and nonlinearity of multiple reservoirs make solving this problem time-consuming. To address this challenge, a deep reinforcement learning approach that incorporates a transformer framework is proposed. The multihead attention mechanism of the encoder effectively extracts information from reservoirs and residential areas, and the multireservoir attention network of the decoder generates suitable operational decisions. The proposed method is applied to Lake Mead and Lake Powell in the Colorado River Basin. The experimental results demonstrate that the transformer-based deep reinforcement learning approach can produce appropriate operational outcomes. Compared to a state-of-the-art method, the operation strategies produced by the proposed approach generate 10.11% more electricity, reduce the amended annual proportional flow deviation by 39.69%, and increase water supply revenue by 4.10%. Consequently, the proposed approach offers an effective method for the multiobjective operation of multihydropower reservoir systems.

Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning

TL;DR

Abstract

Paper Structure (27 sections, 11 equations, 11 figures, 4 tables, 1 algorithm)

This paper contains 27 sections, 11 equations, 11 figures, 4 tables, 1 algorithm.

Introduction
Related work
Problem statement
System model
Power generation
Ecological protection
Water supply
Problem formulation
Decision variables
Objective functions
Constraints
Methodology
General framework
Decomposition strategy
Encoder in the transformer model
...and 12 more sections

Figures (11)

Figure 1: An illustration of a multihydropower reservoir system
Figure 2: Framework of the transformer-based DRL method
Figure 3: The process of Embedding for power generation. (a) involves a two-stage learning progress, while (b) inputs all information directly to the encoder.
Figure 4: The process of Embedding for water supply. (a) involves a two-stage learning progress, while (b) inputs all information directly to the encoder.
Figure 5: Brief view of Lake Mead and Lake Powell
...and 6 more figures

Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning

TL;DR

Abstract

Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (11)