Mining Potentially Explanatory Patterns via Partial Solutions
GianCarlo Catalano, Alexander E. I. Brownlee, David Cairns, John McCall, Russell Ainslie
TL;DR
This work tackles the explainability gap in genetic algorithms for combinatorial optimization by introducing Partial Solutions (PSs): explicit, interpretable sub-configurations extracted from high-fitness solutions. A PS catalog, mined from a fixed reference population, acts as a probabilistic model from which new solutions can be generated via a Pick & Merge process, balancing simplicity, mean fitness, and atomicity through the $F^{\psi}$ objective. The approach yields both global and local explanations and can match or exceed GA performance on benchmark problems while enabling explainability, albeit at a computational cost associated with evaluating $F^{\psi}$. By linking explainability with search efficiency, the PS framework offers a pathway to surrogate modeling and model-based sampling in optimization, with potential extensions to full EDAs and real-world scheduling tasks.
Abstract
Genetic Algorithms have established their capability for solving many complex optimization problems. Even as good solutions are produced, the user's understanding of a problem is not necessarily improved, which can lead to a lack of confidence in the results. To mitigate this issue, explainability aims to give insight to the user by presenting them with the knowledge obtained by the algorithm. In this paper we introduce Partial Solutions in order to improve the explainability of solutions to combinatorial optimization problems. Partial Solutions represent beneficial traits found by analyzing a population, and are presented to the user for explainability, but also provide an explicit model from which new solutions can be generated. We present an algorithm that assembles a collection of Partial Solutions chosen to strike a balance between high fitness, simplicity and atomicity. Experiments with standard benchmarks show that the proposed algorithm is able to find Partial Solutions which improve explainability at reasonable computational cost without affecting search performance.
