DALex: Lexicase-like Selection via Diverse Aggregation

Andrew Ni; Li Ding; Lee Spector

DALex: Lexicase-like Selection via Diverse Aggregation

Andrew Ni, Li Ding, Lee Spector

TL;DR

DALex introduces a matrix-based, lexicase-like selection method that uses randomly weighted aggregation of per-case errors to achieve substantial runtime speedups while preserving nearly identical problem-solving performance across domains. By sampling training-case weights from a distribution and applying batched matrix multiplication, DALex can recover lexicase behavior in the limit of infinite particularity pressure and smoothly interpolate to relaxed variants via the standard deviation of the weight distribution. Empirical results across program synthesis, image classification, symbolic regression, and learning classifier systems demonstrate that DALex matches or closely approximates lexicase-type performance with significant reductions in computation time, enabling larger populations or more generations under the same budget. The approach unifies diverse selection strategies under a single, scalable framework, with broad implications for evolutionary computation and potentially for deep learning and reinforcement learning contexts.

Abstract

Lexicase selection has been shown to provide advantages over other selection algorithms in several areas of evolutionary computation and machine learning. In its standard form, lexicase selection filters a population or other collection based on randomly ordered training cases that are considered one at a time. This iterated filtering process can be time-consuming, particularly in settings with large numbers of training cases. In this paper, we propose a new method that is nearly equivalent to lexicase selection in terms of the individuals that it selects, but which does so significantly more quickly. The new method, called DALex (for Diversely Aggregated Lexicase), selects the best individual with respect to a weighted sum of training case errors, where the weights are randomly sampled. This allows us to formulate the core computation required for selection as matrix multiplication instead of recursive loops of comparisons, which in turn allows us to take advantage of optimized and parallel algorithms designed for matrix multiplication for speedup. Furthermore, we show that we can interpolate between the behavior of lexicase selection and its "relaxed" variants, such as epsilon or batch lexicase selection, by adjusting a single hyperparameter, named "particularity pressure," which represents the importance granted to each individual training case. Results on program synthesis, deep learning, symbolic regression, and learning classifier systems demonstrate that DALex achieves significant speedups over lexicase selection and its relaxed variants while maintaining almost identical problem-solving performance. Under a fixed computational budget, these savings free up resources that can be directed towards increasing population size or the number of generations, enabling the potential for solving more difficult problems.

DALex: Lexicase-like Selection via Diverse Aggregation

TL;DR

Abstract

Paper Structure (16 sections, 1 equation, 4 figures, 3 tables, 1 algorithm)

This paper contains 16 sections, 1 equation, 4 figures, 3 tables, 1 algorithm.

Introduction
Background and Related Work
Diversely Aggregated Lexicase Selection
Description
Intuition
Modifications
Experiments and Results
CBGP
Image Classification
SRBench
Learning Classifier Systems
Conclusion and Future Work
Acknowledgements
Code-Building GP for Program Synthesis
Program Synthesis Runtime Comparisons
...and 1 more sections

Figures (4)

Figure 1: Fidelity of the DALex and plexicase approximations to lexicase selection on CBGP problems. Error bars show the bootstrapped 95% confidence intervals. The ratio of selecting the successful lineage via DALex versus the probability using lexicase is very close to 1 in 6 out of 7 problems. The Jensen-Shannon divergence of the DALex selection probability distribution from the lexicase distribution is close to 0 in 5 out of 7 problems.
Figure 2: Performance of three selection methods on 20 downsampled problems from the PMLB repository. DALex has a similar test R$^2$ to epsilon lexicase while having a lower runtime and generating more concise models.
Figure 3: Performance of four selection methods on the led24 problem in the full data and partial data scenarios. Bootstrapped 95% confidence intervals are displayed as error bars for the accuracy and runtime plots and as a shaded region for the rulecount plot. DALex has approximately equal test accuracy to batch lexicase while having much lower runtime and generating the fewest rules.
Figure 4: Runtime comparison of three selecion methods on program synthesis problems. 95% confidence intervals obtained by bootstrapping.

DALex: Lexicase-like Selection via Diverse Aggregation

TL;DR

Abstract

DALex: Lexicase-like Selection via Diverse Aggregation

Authors

TL;DR

Abstract

Table of Contents

Figures (4)