Optimal Inflationary Potentials
Tomás Sousa, Deaglan J. Bartlett, Harry Desmond, Pedro G. Ferreira
TL;DR
This work tackles the underdetermination of inflationary potentials by introducing Exhaustive Symbolic Regression (ESR) to generate all simple single-field, slow-roll potentials from chosen operator bases. Potentials are ranked using the Minimum Description Length (MDL) criterion, balancing data fit against structural complexity, and optionally reweighted with a Katz back-off language model to reflect theoretical motivation. The analysis reveals MDL-optimal forms often involve highly nested exponential structures, with language priors shifting preferences toward more conventional, physically motivated shapes; tensor-to-scalar ratios can remain very small even for relatively simple expressions, and some results approach the sensitivity of upcoming surveys. The approach demonstrates a data-driven, principled route to extract implications for fundamental physics from cosmological data, while highlighting limitations tied to basis choice, prior definitions, and the need for reheating and stability considerations in viable models.
Abstract
Inflation is a highly favoured theory for the early Universe. It is compatible with current observations of the cosmic microwave background and large scale structure and is a driver in the quest to detect primordial gravitational waves. It is also, given the current quality of the data, highly under-determined with a large number of candidate implementations. We use a new method in symbolic regression to generate all possible simple scalar field potentials for one of two possible basis sets of operators. Treating these as single-field, slow-roll inflationary models we then score them with an information-theoretic metric ("minimum description length") that quantifies their efficiency in compressing the information in current data. We explore two possible priors on the parameter space of potentials, one related to the functions' structural complexity and one that uses a Katz back-off language model to prefer functions that may be theoretically motivated. This enables us to identify the inflaton potentials that optimally balance simplicity with accuracy at explaining current data, which may subsequently find theoretical motivation. Our exploratory study opens the door to extraction of fundamental physics directly from data, and may be augmented with more refined theoretical priors in the quest for a complete understanding of the early Universe.
