On the Nonconvexity of Push-Forward Constraints and Its Consequences in Machine Learning
Lucas de Lara, Mathis Deronzier, Alberto González-Sanz, Virgile Foy
TL;DR
This paper tackles the problem that push-forward constraints, essential in optimal transport, generative modeling, and group fairness, are typically nonconvex, precluding universal convex formulations. It develops a detailed convexity analysis of two key sets: transport maps $\mathcal{T}(P,Q)$ and equalizing maps $\mathcal{E}(P,Q)$, showing nonconvexity in broad regimes and highlighting the mismatch between convexity in measure space and function space. The authors then illustrate the practical impact on ML tasks, arguing that convex losses cannot quantify deviation from nonconvex push-forward constraints, with concrete implications for generative modeling and fairness. Finally, they propose two directions to recover convexity: weakening/strengthening the constraint to convex surrogates, or replacing deterministic maps with random couplings (Kantorovich-style relaxations), offering a roadmap for designing tractable, if not strictly convex, learning problems under push-forward conditions.
Abstract
The push-forward operation enables one to redistribute a probability measure through a deterministic map. It plays a key role in statistics and optimization: many learning problems (notably from optimal transport, generative modeling, and algorithmic fairness) include constraints or penalties framed as push-forward conditions on the model. However, the literature lacks general theoretical insights on the (non)convexity of such constraints and its consequences on the associated learning problems. This paper aims at filling this gap. In the first part, we provide a range of sufficient and necessary conditions for the (non)convexity of two sets of functions: the maps transporting one probability measure to another and the maps inducing equal output distributions across distinct probability measures. This highlights that for most probability measures, these push-forward constraints are not convex. In the second part, we show how this result implies critical limitations on the design of convex optimization problems for learning generative models or groupwise fair predictors. This work will hopefully help researchers and practitioners have a better understanding of the critical impact of push-forward conditions onto convexity.
