Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
Brenden K. Petersen, Mikel Landajuela, T. Nathan Mundhenk, Claudio P. Santiago, Soo K. Kim, Joanne T. Kim
TL;DR
This paper tackles the problem of symbolic regression by introducing Deep Symbolic Regression (DSR), a framework that uses a large autoregressive neural network to search the space of concise mathematical expressions and trains it with a risk-seeking policy gradient to optimize for best-case, high-quality solutions. Expressions are generated as pre-order traversals of expression trees, with in-situ constraints and optional constant optimization guiding the search, and rewards are computed via $R(\tau)=\frac{1}{1+NRMSE}$. Empirically, DSR outperforms multiple baselines, including GP-based methods and commercial tools, on the Nguyen benchmark suite, especially in exact symbolic recovery, while demonstrating robustness to noise and favorable runtimes with early stopping. The approach offers a general framework for optimizing hierarchical, variable-length objects under black-box metrics and suggests broad applicability to AutoML, program synthesis, and other domains where best-case performance is paramount.
Abstract
Discovering the underlying mathematical expressions describing a dataset is a core challenge for artificial intelligence. This is the problem of $\textit{symbolic regression}$. Despite recent advances in training neural networks to solve complex tasks, deep learning approaches to symbolic regression are underexplored. We propose a framework that leverages deep learning for symbolic regression via a simple idea: use a large model to search the space of small models. Specifically, we use a recurrent neural network to emit a distribution over tractable mathematical expressions and employ a novel risk-seeking policy gradient to train the network to generate better-fitting expressions. Our algorithm outperforms several baseline methods (including Eureqa, the gold standard for symbolic regression) in its ability to exactly recover symbolic expressions on a series of benchmark problems, both with and without added noise. More broadly, our contributions include a framework that can be applied to optimize hierarchical, variable-length objects under a black-box performance metric, with the ability to incorporate constraints in situ, and a risk-seeking policy gradient formulation that optimizes for best-case performance instead of expected performance.
