Availability of Perfect Decomposition in Statistical Linkage Learning for Unitation-based Function Concatenations
Michal Prusik, Bartosz Frej, Michal W. Przewozniczek
TL;DR
This work develops an entropy-based, Chernoff-bounds–driven estimate for the minimal population size $s_{min}$ required to achieve a perfect SLL-based decomposition when solving concatenations of unitation-based symmetric functions. By analyzing the geometry of probability distributions on the entropy simplex and the dependency-structure induced by FIHC-driven DSM construction, the authors provide an explicit criterion linking problem structure, block size, and desired confidence $\alpha$. The theory is validated on bimodal and reverted bimodal deceptive function concatenations, showing that the predicted $s_{min}$ aligns with qualitative difficulty trends and explains why certain SLL-using optimizers struggle on specific problem classes. The results offer a practical hardness measure for SLL and point to extensions for non-symmetric, overlapping, or noisier problems.
Abstract
Statistical Linkage Learning (SLL) is a part of many state-of-the-art optimizers. The purpose of SLL is to discover variable interdependencies. It has been shown that the effectiveness of SLL-using optimizers is highly dependent on the quality of SLL-based problem decomposition. Thus, understanding what kind of problems are hard or easy to decompose by SLL is important for practice. In this work, we analytically estimate the size of a population sufficient for obtaining a perfect decomposition in case of concatenations of certain unitation-based functions. The experimental study confirms the accuracy of the proposed estimate. Finally, using the proposed estimate, we identify those problem types that may be considered hard for SLL-using optimizers.
