Table of Contents
Fetching ...

GPU-Accelerated Genetic Programming for Symbolic Regression with Beagle Framework

Nathan Haut, Ilya Basin, Marzieh Kianinejad, Ruchika Gupta, Elijah Smith, Zachary Perrico, Wolfgang Banzhaf

Abstract

Beagle is a new software framework that enables execution of Genetic Programming tasks on the GPU. Currently available for symbolic regression, it processes individuals of the population and fitness cases for training in a way that maximizes throughput on extant GPU platforms. In this contribution, we report on the benchmarking of Beagle on the Feynman Symbolic Regression dataset and compare its performance with a fast CPU system called StackGP and the widely available PySR system under the same wall clock budget. We also report on the use of two different fitness functions, one a point-to-point error function, the other a correlation fitness function. The results demonstrate that the Beagle's GPU-aided Symbolic Regression significantly outperforms leading CPU-based frameworks.

GPU-Accelerated Genetic Programming for Symbolic Regression with Beagle Framework

Abstract

Beagle is a new software framework that enables execution of Genetic Programming tasks on the GPU. Currently available for symbolic regression, it processes individuals of the population and fitness cases for training in a way that maximizes throughput on extant GPU platforms. In this contribution, we report on the benchmarking of Beagle on the Feynman Symbolic Regression dataset and compare its performance with a fast CPU system called StackGP and the widely available PySR system under the same wall clock budget. We also report on the use of two different fitness functions, one a point-to-point error function, the other a correlation fitness function. The results demonstrate that the Beagle's GPU-aided Symbolic Regression significantly outperforms leading CPU-based frameworks.
Paper Structure (16 sections, 3 equations, 2 figures, 5 tables)

This paper contains 16 sections, 3 equations, 2 figures, 5 tables.

Figures (2)

  • Figure 1: The 10 minute benchmarking performance of the different symbolic regression methods. The darker shaded regions indicate the typical performance ($>=50\%$ solved), whereas the lighter region indicates the number of problems solved at least once.
  • Figure 2: The 30 minute benchmarking performance of the different symbolic regression methods. The darker shaded regions indicate the typical performance ($>=50\%$ solved), whereas the lighter region indicates the number of problems solved at least once.