Deep Reinforcement Learning for Radiative Heat Transfer Optimization Problems
Eva Ortiz-Mansilla, Juan José García-Esteban, Jorge Bravo-Abad, Juan Carlos Cuevas
TL;DR
This work shows that reinforcement learning can be effectively applied to optimization problems in radiative heat transfer, using near-field transfer between multilayer hyperbolic metamaterials as a test case. By formulating layer configurations as sequential decisions, the authors compare a suite of RL algorithms (including SARSA, Double DQN, REINFORCE, A2C, and PPO) and demonstrate that Double DQN offers the best sample efficiency while PPO delivers robust performance with fewer explored states. The results reveal that RL can surpass physically intuitive baselines, achieving up to ~21% higher HTC for 16-layer stacks and scalable gains to 24-layer configurations, thereby providing a practical toolkit for optimization and inverse design in radiative heat transfer. The authors also provide public code to facilitate applying these RL methods to similar thermal-radiation problems and outline guidance for selecting algorithms based on problem characteristics.
Abstract
Reinforcement learning is a subfield of machine learning that is having a huge impact in the different conventional disciplines, including physical sciences. Here, we show how reinforcement learning methods can be applied to solve optimization problems in the context of radiative heat transfer. We illustrate their use with the optimization of the near-field radiative heat transfer between multilayer hyperbolic metamaterials. Specifically, we show how this problem can be formulated in the language of reinforcement learning and tackled with a variety of algorithms. We show that these algorithms allow us to find solutions that outperform those obtained using physical intuition. Overall, our work shows the power and potential of reinforcement learning methods for the investigation of a wide variety of problems in the context of radiative heat transfer and related topics.
