Design Optimization of Nuclear Fusion Reactor through Deep Reinforcement Learning
Jinsu Kim, Jaemin Seo
TL;DR
This work tackles the challenge of designing a steady-state tokamak reactor under multiple operational constraints (density, beta, kink instability, bootstrap fraction) while minimizing cost. It proposes a Deep Reinforcement Learning framework using Proximal Policy Optimization (PPO) to optimize reactor design by scalarizing multiple objectives into a reward signal, trained against a custom design computation environment. The approach demonstrates that DRL can find cost-reduced designs that satisfy all steady-state constraints, outperforming grid-search in efficiency and revealing multiple viable operating regimes; for example, DRL found a design with $Q\approx6.03$ while reference and grid-search designs reached higher $Q$ values, indicating trade-offs between cost and confinement. Overall, the framework offers a scalable, parallelizable method for multi-objective tokamak design optimization with potential for extension to material and profile-shape considerations, reducing computational costs for conceptual reactor design.
Abstract
This research explores the application of Deep Reinforcement Learning (DRL) to optimize the design of a nuclear fusion reactor. DRL can efficiently address the challenging issues attributed to multiple physics and engineering constraints for steady-state operation. The fusion reactor design computation and the optimization code applicable to parallelization with DRL are developed. The proposed framework enables finding the optimal reactor design that satisfies the operational requirements while reducing building costs. Multi-objective design optimization for a fusion reactor is now simplified by DRL, indicating the high potential of the proposed framework for advancing the efficient and sustainable design of future reactors.
