Reinforcement Learning techniques for the flavor problem in particle physics
A. Giarnetti, D. Meloni
TL;DR
This review surveys how Reinforcement Learning is applied to the flavor problem in particle physics, focusing on Froggatt–Nielsen–type models that generate fermion mass hierarchies. By framing model-building as sequential decision processes, RL agents learn to select charge assignments or model ingredients that reproduce quark and lepton observables, often rediscovering known textures and uncovering new viable configurations. The highlighted work spans policy-based methods (REINFORCE) in quark FN textures, value-based learning (DQN) in lepton sectors, and autonomous design with PPO in broader theory spaces (AMBer), demonstrating substantial navigation efficiency in otherwise intractable model spaces. Notably, RL results hint at a statistical preference for Normal Ordering in neutrino masses and illustrate the potential for fully autonomous, data-constrained theory design frameworks, marking a promising direction for future beyond-Standard-Model explorations.
Abstract
This short review discusses recent applications of Reinforcement Learning (RL) techniques to the flavor problem in particle physics. Traditional approaches to fermion masses and mixing often rely on extensions of the Standard Model based on horizontal symmetries, but the vast landscape of possible models makes systematic exploration infeasible. Recent works have shown that RL can efficiently navigate this landscape by constructing models that reproduce observed quark and lepton observables. These approaches demonstrate that RL not only rediscovers models already proposed in the literature but also uncovers new, phenomenologically acceptable solutions.
