Learning and Testing Convex Functions
Renato Ferreira Pinto, Cassandra Marcussen, Elchanan Mossel, Shivam Nadimpalli
TL;DR
The paper addresses learning and testing of real-valued convex functions on Gaussian space under Lipschitz regularity, framing the problems in the $L^2(\gamma)$ metric to the convex Lipschitz class ${\mathcal C}(L)$. It introduces an agnostic proper learner with sample complexity $n^{O(L^2/\varepsilon^2)}$ and proves a CSQ lower bound of $n^{\mathrm{poly}(L/\varepsilon)}$, as well as a tolerant two-sided tester with the same sample bounds and a one-sided tester with exponential-in-$n$ samples, leveraging Hermite analysis and the Ornstein–Uhlenbeck operator. The approach combines low-degree (polynomial) approximations with convex regression to obtain a proper learner, and uses an empirical convex envelope to realize one-sided testing, including a novel box-restriction technique to transfer Gaussian-space results to finite domains. The results illuminate the computational and statistical landscape for real-valued convexity in high dimensions, offering polynomial-in-$n$ performance under Lipschitzness and outlining clear directions for further tightening lower bounds and runtime gaps. Overall, the work advances the understanding of convex function learning and testing in continuous, high-dimensional settings and establishes a principled framework that connects approximation theory, Hermite analysis, and convex geometry.
Abstract
We consider the problems of \emph{learning} and \emph{testing} real-valued convex functions over Gaussian space. Despite the extensive study of function convexity across mathematics, statistics, and computer science, its learnability and testability have largely been examined only in discrete or restricted settings -- typically with respect to the Hamming distance, which is ill-suited for real-valued functions. In contrast, we study these problems in high dimensions under the standard Gaussian measure, assuming sample access to the function and a mild smoothness condition, namely Lipschitzness. A smoothness assumption is natural and, in fact, necessary even in one dimension: without it, convexity cannot be inferred from finitely many samples. As our main results, we give: - Learning Convex Functions: An agnostic proper learning algorithm for Lipschitz convex functions that achieves error $\varepsilon$ using $n^{O(1/\varepsilon^2)}$ samples, together with a complementary lower bound of $n^{\mathrm{poly}(1/\varepsilon)}$ samples in the \emph{correlational statistical query (CSQ)} model. - Testing Convex Functions: A tolerant (two-sided) tester for convexity of Lipschitz functions with the same sample complexity (as a corollary of our learning result), and a one-sided tester (which never rejects convex functions) using $O(\sqrt{n}/\varepsilon)^n$ samples.
