Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains
Nils Müller
TL;DR
This work introduces relaxation-based optimization for functions that decompose as a sum of bivariate terms on finite domains, i.e., $F(x)=\sum_{(i,j)\in\mathcal{E}} f_{i,j}(x_i,x_j)$. By replacing the search domain with probability measures and studying tree- and star-structured relaxations, it derives tractable formulations via linear programming, dynamic programming on trees, and entropy-regularized duals, together with dual-to-primal recovery procedures. The paper proves NP-hardness for general graphs while showing exact recoverability and efficient solutions for tree-structured cases, and it develops several algorithms (coordinate descent, dual LP, block-coordinate ascent) that leverage these relaxations. Empirical results on random sums, vertex coloring, and signal reconstruction illustrate when relaxation-based methods excel and where they struggle, highlighting the trade-offs between primal quality and dual convergence across problem classes. Overall, the framework offers a principled pathway to tractable global optimization for sums of bivariates with broad relevance to inverse problems and graphical-model-like settings, and it lays groundwork for future extensions to more general graph structures and regularized objectives.
Abstract
We study the optimization of functions with $n>2$ arguments that have a representation as a sum of several functions that have only $2$ of the $n$ arguments each, termed sums of bivariates, on finite domains. The complexity of optimizing sums of bivariates is shown to be NP-equivalent and it is shown that there exists free lunch in the optimization of sums of bivariates. Based on measure-valued extensions of the objective function, so-called relaxations, $\ell^2$-approximation, and entropy-regularization, we derive several tractable problem formulations solvable with linear programming, coordinate ascent as well as with closed-form solutions. The limits of applying tractable versions of such relaxations to sums of bivariates are investigated using general results for reconstructing measures from their bivariate marginals. Experiments in which the derived algorithms are applied to random functions, vertex coloring, and signal reconstruction problems provide insights into qualitatively different function classes that can be modeled as sums of bivariates.
