Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse, Felix Kleuker, Márton Fejér, Álvaro Serra-Gómez, Aske Plaat, Thomas Moerland
TL;DR
This survey addresses reinforcement learning for sustainable energy across the entire energy chain—generation, storage, consumption, and grids—arguing that RL's sequential decision-making capabilities can address intermittency, storage optimization, demand management, and grid stability. It provides an integrated taxonomy, surveys of RL basics, and a detailed review of applications by pillar, while highlighting overarching RL themes (multi-agent, offline, and safe RL) and the need for standardized benchmarks (e.g., SustainGym, CityLearn, L2RPN) to accelerate cross-disciplinary progress. The authors identify key challenges, including data scarcity, model uncertainty, safety constraints, and the gap between energy research and ML methodology, and propose future directions such as model-based and offline RL, graph-based approaches, and multi-objective/multi-scale methods. Overall, the paper argues that standardized benchmarks and closer collaboration between energy researchers and ML researchers are essential to realize RL's potential in enabling a scalable and reliable energy transition.
Abstract
The transition to sustainable energy is a key challenge of our time, requiring modifications in the entire pipeline of energy production, storage, transmission, and consumption. At every stage, new sequential decision-making challenges emerge, ranging from the operation of wind farms to the management of electrical grids or the scheduling of electric vehicle charging stations. All such problems are well suited for reinforcement learning, the branch of machine learning that learns behavior from data. Therefore, numerous studies have explored the use of reinforcement learning for sustainable energy. This paper surveys this literature with the intention of bridging both the underlying research communities: energy and machine learning. After a brief introduction of both fields, we systematically list relevant sustainability challenges, how they can be modeled as a reinforcement learning problem, and what solution approaches currently exist in the literature. Afterwards, we zoom out and identify overarching reinforcement learning themes that appear throughout sustainability, such as multi-agent, offline, and safe reinforcement learning. Lastly, we also cover standardization of environments, which will be crucial for connecting both research fields, and highlight potential directions for future work. In summary, this survey provides an extensive overview of reinforcement learning methods for sustainable energy, which may play a vital role in the energy transition.
