Tensor Network Estimation of Distribution Algorithms
John Gardiner, Javier Lopez-Piqueres
TL;DR
This work reframes tensor-network generative models as Estimation of Distribution Algorithms (EDAs) and analyzes how their optimization performance relates to the quality of the underlying model. By comparing GEO and PROTES and by introducing explicit mutation as an exploration mechanism, the authors show that stronger generative models do not automatically yield better optimization, and that adding mutations can improve performance even when the model becomes a worse estimator of the training data distribution. Across experiments with equal-weighted portfolio optimization and benchmark problems, a low-expressivity MPS (bond dimension ~2) with Boltzmann-based selection and mutation often matches or surpasses more expressive TNs or Bayesian-network EDAs. The results advocate separating exploration from exploitation in TN-EDA design and suggest that future work should explore principled mutation strategies and generalization metrics to unlock the practical potential of tensor-network-based optimization.
Abstract
Tensor networks are a tool first employed in the context of many-body quantum physics that now have a wide range of uses across the computational sciences, from numerical methods to machine learning. Methods integrating tensor networks into evolutionary optimization algorithms have appeared in the recent literature. In essence, these methods can be understood as replacing the traditional crossover operation of a genetic algorithm with a tensor network-based generative model. We investigate these methods from the point of view that they are Estimation of Distribution Algorithms (EDAs). We find that optimization performance of these methods is not related to the power of the generative model in a straightforward way. Generative models that are better (in the sense that they better model the distribution from which their training data is drawn) do not necessarily result in better performance of the optimization algorithm they form a part of. This raises the question of how best to incorporate powerful generative models into optimization routines. In light of this we find that adding an explicit mutation operator to the output of the generative model often improves optimization performance.
