Lexicase Selection Parameter Analysis: Varying Population Size and Test Case Redundancy with Diagnostic Metrics

Jose Guadalupe Hernandez; Anil Kumar Saini; Jason H. Moore

Lexicase Selection Parameter Analysis: Varying Population Size and Test Case Redundancy with Diagnostic Metrics

Jose Guadalupe Hernandez, Anil Kumar Saini, Jason H. Moore

TL;DR

The paper investigates how hidden parameters, notably population size and test-case redundancy, influence Lexicase selection under a fixed evaluation budget. It employs the DOSSIER diagnostic suite, focusing on exploitation rate and contradictory objectives diagnostics, plus a redundancy-extension, to quantify exploitation, specialist maintenance, and the impact of duplicate test cases. Key findings show that smaller populations enable faster exploitation, while larger populations preserve more specialists; redundancy often hampers specialist optimization, especially at large population sizes, revealing a nuanced budget-aware tradeoff. The work provides practical guidance for tuning Lexicase alongside budget constraints and highlights the importance of problem-specific configurations for achieving optimal performance.

Abstract

Lexicase selection is a successful parent selection method in genetic programming that has outperformed other methods across multiple benchmark suites. Unlike other selection methods that require explicit parameters to function, such as tournament size in tournament selection, lexicase selection does not. However, if evolutionary parameters like population size and number of generations affect the effectiveness of a selection method, then lexicase's performance may also be impacted by these `hidden' parameters. Here, we study how these hidden parameters affect lexicase's ability to exploit gradients and maintain specialists using diagnostic metrics. By varying the population size with a fixed evaluation budget, we show that smaller populations tend to have greater exploitation capabilities, whereas larger populations tend to maintain more specialists. We also consider the effect redundant test cases have on specialist maintenance, and find that high redundancy may hinder the ability to optimize and maintain specialists, even for larger populations. Ultimately, we highlight that population size, evaluation budget, and test cases must be carefully considered for the characteristics of the problem being solved.

Lexicase Selection Parameter Analysis: Varying Population Size and Test Case Redundancy with Diagnostic Metrics

TL;DR

Abstract

Paper Structure (19 sections, 7 figures, 1 table)

This paper contains 19 sections, 7 figures, 1 table.

Lexicase Selection Analysis
Lexicase Selection
Analyzing Performance
Selection scheme diagnostics
Exploitation rate diagnostic
Contradictory objective diagnostic
Test case redundancy
Methods
Evaluation budget
Diagnostic experiments
Data tracking
Statistical analysis
Software availability
Results and Discussion
Smaller population sizes facilitate faster exploitation
...and 4 more sections

Figures (7)

Figure 1.1: Example phenotype construction for the contradictory objectives diagnostic. Note that the trait values of all genes except for the maximum value are zero. In this case, all trait values serve as test cases.
Figure 1.2: Mapping from an example genotype to phenotype. The trait values corresponding to the colored genes are duplicated and placed at the end of the existing sequence of test cases.
Figure 1.3: Results for lexicase selection with varying population sizes on the exploitation rate diagnostic. We report (a) the best performance in the population over time, (b) the best performance evolved throughout the evolutionary run, and (c) the total accumulated evaluations when a satisfactory solution was first discovered. For panel (a), we plot the average performance with surrounding boundaries from the best and worst performances across the 50 replicates for every $10^{8}$ evaluations.
Figure 1.4: Results for lexicase selection with varying population sizes on the contradictory objectives diagnostic. We report (a) the satisfactory trait coverage and (b) the activation gene coverage in the population over time, and (c) the best satisfactory trait coverage found throughout an evolutionary run. For panels (a) and (b), we plot the average average with surrounding boundaries from the best and worst coverage across the 50 replicates for every $10^{8}$ evaluations.
Figure 1.5: Results for contradictory objectives with 100 redundant test cases We report (a) the satisfactory trait coverage and (b) the activation gene coverage in the population over time, and (c) the best satisfactory trait coverage found throughout an evolutionary run. For panels (a) and (b), we plot the average average with surrounding boundaries from the best and worst coverage across the 50 replicates for every $10^{8}$ evaluations.
...and 2 more figures

Lexicase Selection Parameter Analysis: Varying Population Size and Test Case Redundancy with Diagnostic Metrics

TL;DR

Abstract

Lexicase Selection Parameter Analysis: Varying Population Size and Test Case Redundancy with Diagnostic Metrics

Authors

TL;DR

Abstract

Table of Contents

Figures (7)