Feature weighting for data analysis via evolutionary simulation
Aris Daniilidis, Alberto Domínguez Corella, Philipp Wissgott
TL;DR
The paper analyzes a data-analysis problem where feature relevance is learned via a replicator-type dynamical system on the simplex. By normalizing the data into a matrix $\Phi\in[0,1]^{n\times m}$ and defining a combined gain $\Delta_j(\gamma)$ from dominance and balance terms, the weights $\gamma$ evolve to a unique interior equilibrium $\gamma^*$. The equilibrium has a closed-form expression, ensuring non-degenerate limiting weights and enabling Pareto-aware scalarization of multi-objective problems; an illustrative numerical example demonstrates convergence and the role of rare but valuable features. The work positions itself between evolutionary multi-objective optimization and traditional feature-weighting methods, offering analytic tractability, convergence guarantees, and an interpretation of data-driven weights through gene-organism dynamics. Overall, this approach provides a principled, convergence-guaranteed method to learn feature weights from data, with potential applicability across data-driven decision problems and multi-objective analysis.
Abstract
We analyze an algorithm for assigning weights prior to scalarization in discrete multi-objective problems arising from data analysis. The algorithm evolves the weights (the relevance of features) by a replicator-type dynamic on the standard simplex, with update indices computed from a normalized data matrix. We prove that the resulting sequence converges globally to a unique interior equilibrium, yielding non-degenerate limiting weights. The method, originally inspired by evolutionary game theory, differs from standard weighting schemes in that it is analytically tractable with provable convergence.
