Techno-economic optimization of a heat-pipe microreactor, part I: theory and cost optimization
Paul Seurin, Dean Price, Luis Nunez
TL;DR
This paper tackles the economic viability of heat-pipe microreactors (HPMRs) by introducing a unifying framework that blends surrogate models with reinforcement learning to minimize the FOAK LCOE under safety and performance constraints. Using OpenMC for neutronics and MOUSE COA for techno-economic analysis, the authors explore a seven-parameter design space around a nominal HPMR and demonstrate that O&M and capital costs—especially axial reflector and control-drum materials—are dominant cost drivers. Two cost-sensitivity cases (Be reflector expensive vs graphite-like cheaper) show substantial LCOE reductions (≈57% and ≈78%, respectively) through design adjustments such as altering the axial reflector thickness and moderator size. Surrogate models (Gaussian processes and MLPs) achieve strong predictive accuracy, enabling efficient RL-guided optimization, with a roadmap for integrating full fuel and HP performance in a companion multi-objective study. This work provides a first-of-its-kind methodology to inform early-stage, economically informed HPMR design iterations and highlights key paths to improve competitiveness in remote-energy applications.
Abstract
Microreactors, particularly heat-pipe microreactors (HPMRs), are compact, transportable, self-regulated power systems well-suited for access-challenged remote areas where costly fossil fuels dominate. However, they suffer from diseconomies of scale, and their financial viability remains unconvincing. One step in addressing this shortcoming is to design these reactors with comprehensive economic and physics analyses informing early-stage design iteration. In this work, we present a novel unifying geometric design optimization approach that accounts for techno-economic considerations. We start by generating random samples to train surrogate models, including Gaussian processes (GPs) and multi-layer perceptrons (MLPs). We then deploy these surrogates within a reinforcement learning (RL)-based optimization framework to optimize the levelized cost of electricity (LCOE), all the while imposing constraints on the fuel lifetime, shutdown margin (SDM), peak heat flux, and rod-integrated peaking factor. We study two cases: one in which the axial reflector cost is very high, and one in which it is inexpensive. We found that the operation and maintenance and capital costs are the primary contributors to the overall LCOE particularly the cost of the axial reflectors (for the first case) and the control drum materials. The optimizer cleverly changes the design parameters so as to minimize one of them while still satisfying the constraints, ultimately reducing the LCOE by more than 57% in both instances. A comprehensive integration of fuel and HP performance with multi-objective optimization is currently being pursued to fully understand the interaction between constraints and cost performance.
