Universal Representation of Generalized Convex Functions and their Gradients
Moeen Nehzati
TL;DR
The paper develops a differentiable, convex-parameter-space parameterization for generalized convex functions (GCFs) and their gradients, proving universal approximation properties for both functions and gradients under mild regularity. It provides a neural-network-inspired interpretation via finitely $Y$-convex constructions with max-aggregation, along with smoothing via log-sum-exp to ensure differentiability. The framework enables converting bilevel or min-max problems in optimal transport with general costs and multi-item auctions into single-level problems solvable via first-order methods. Experiments on optimal transport and auction design validate the approach and demonstrate practical performance, with an open-source implementation released for reproducibility. This work lays a foundation for structure-aware, gradient-friendly representations of GCFs with potential for deeper finitely convex architectures in economics and optimization.
Abstract
A wide range of optimization problems can often be written in terms of generalized convex functions (GCFs). When this structure is present, it can convert certain nested bilevel objectives into single-level problems amenable to standard first-order optimization methods. We provide a new differentiable layer with a convex parameter space and show (Theorems 5.1 and 5.2) that it and its gradient are universal approximators for GCFs and their gradients. We demonstrate how this parameterization can be leveraged in practice by (i) learning optimal transport maps with general cost functions and (ii) learning optimal auctions of multiple goods. In both these cases, we show how our layer can be used to convert the existing bilevel or min-max formulations into single-level problems that can be solved efficiently with first-order methods.
