ReactorFold: Generative discovery of nuclear reactor cores via emergent physical reasoning
Yoonpyo Lee
TL;DR
ReactorFold reframes nuclear reactor core design as a sequence-generation problem and demonstrates that a language-model–driven generator can discover feasible, high-performing, and asymmetric lattice topologies beyond traditional symmetry and fixed-parameter constraints. The method uses a three-stage curriculum (FFT, LoRA, DPO) with Monte Carlo-based physics feedback to produce designs that are validated by OpenMC, achieving a six-fold improvement in objective fitness over a fixed-inventory GA within 1,000 high-fidelity evaluations. Key insights include emergent constraint relaxation (modulating Gd inventory) and the discovery of design-space regions inaccessible to conventional search, supported by analyses showing decoupled objective correlations. The work suggests a new paradigm for accelerated, physics-informed inverse design in nuclear engineering and potentially other discrete-design domains, while noting limitations to 2D cores and the need for multi-physics integration in future work.
Abstract
Designing nuclear reactor cores requires navigating large discrete design spaces governed by complex neutronic interactions. Traditional deterministic, metaheuristic, and machine-learning-assisted methods search within fixed, human-defined configuration spaces, limiting their ability to discover fundamentally new design topologies. Here we introduce ReactorFold, a generative framework that reformulates fuel-assembly design as a sequence modeling problem for language models. Using Monte Carlo data, parameter-efficient fine-tuning, and Direct Preference Optimization (DPO), the model learns the latent structure of a pressurized-water-reactor assembly and generates candidate layouts in a single forward pass. Notably, the DPO-aligned model exhibits emergent design-space expansion: despite being trained exclusively on configurations with a fixed number of gadolinium burnable absorber (Gd) rods, it autonomously adjusts Gd inventory to satisfy strict power-peaking constraints. The model also discovers high-performing asymmetric configurations that challenge conventional symmetric loading heuristics, accessing design regimes inaccessible to conventional search methods and demonstrating that language models can internalize causal physical relationships and transcend human-imposed design constraints.
