Generalized Fixed-Depth Prefix and Postfix Symbolic Regression Grammars

Edward Finkelstein

Generalized Fixed-Depth Prefix and Postfix Symbolic Regression Grammars

Edward Finkelstein

TL;DR

The paper tackles the efficiency of symbolic regression by introducing faultless fixed-depth grammars for both prefix and postfix representations, guaranteeing the generation of any expression at a specified complexity. It implements five SR algorithms—Random Search, Monte Carlo Tree Search, Particle Swarm Optimization, Genetic Programming, and Simulated Annealing—within a common C++/Eigen framework and benchmarks them on Hemberg and AI Feynman expressions. A key finding is that the average number of nodes per layer in the ground-truth expression strongly predicts whether prefix or postfix notation performs better, and a decision tree using this feature achieves notable predictive accuracy. This work offers a practical path to more efficient SR by constraining the search to fixed-depth spaces and suggesting integration into existing SR toolchains to accelerate discovery across disciplines.

Abstract

We develop faultless, fixed-depth, string-based, prefix and postfix symbolic regression grammars, capable of producing \emph{any} expression from a set of operands, unary operators and/or binary operators. Using these grammars, we outline simplified forms of 5 popular heuristic search strategies: Brute Force Search, Monte Carlo Tree Search, Particle Swarm Optimization, Genetic Programming, and Simulated Annealing. For each algorithm, we compare the relative performance of prefix vs postfix for ten ground-truth expressions implemented entirely within a common C++/Eigen framework. Our experiments show a comparatively strong correlation between the average number of nodes per layer of the ground truth expression tree and the relative performance of prefix vs postfix. The fixed-depth grammars developed herein can enhance scientific discovery by increasing the efficiency of symbolic regression, enabling faster identification of accurate mathematical models across various disciplines.

Generalized Fixed-Depth Prefix and Postfix Symbolic Regression Grammars

TL;DR

Abstract

Generalized Fixed-Depth Prefix and Postfix Symbolic Regression Grammars

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)