Near-optimal delta-convex estimation of Lipschitz functions
Gábor Balázs
TL;DR
The paper tackles nonparametric regression for Lipschitz functions under random design, introducing delta-convex fitting (DCF) to approximate Lipschitz targets via a nonlinear feature expansion of max-affine form. It combines adaptive center selection (AFPC), convex empirical risk minimization over a DC function class, and a two-stage refinement to produce a near-minimax estimator with respect to the intrinsic dimension $d_*$, without knowing the true Lipschitz constant $\lambda_*$. The authors establish PAC guarantees and a near-minimax convergence rate up to polylog factors under subgaussian covariates and noise, and they show how DCF readily adapts to convex shape-restricted regression. Empirically, DCF achieves competitive performance against both theory-grounded baselines and modern tree-based methods, while providing a tractable, scalable framework for Lipschitz-function estimation in high-dimensional settings.
Abstract
This paper presents a tractable algorithm for estimating an unknown Lipschitz function from noisy observations and establishes an upper bound on its convergence rate. The approach extends max-affine methods from convex shape-restricted regression to the more general Lipschitz setting. A key component is a nonlinear feature expansion that maps max-affine functions into a subclass of delta-convex functions, which act as universal approximators of Lipschitz functions while preserving their Lipschitz constants. Leveraging this property, the estimator attains the minimax convergence rate (up to logarithmic factors) with respect to the intrinsic dimension of the data under squared loss and subgaussian distributions in the random design setting. The algorithm integrates adaptive partitioning to capture intrinsic dimension, a penalty-based regularization mechanism that removes the need to know the true Lipschitz constant, and a two-stage optimization procedure combining a convex initialization with local refinement. The framework is also straightforward to adapt to convex shape-restricted regression. Experiments demonstrate competitive performance relative to other theoretically justified methods, including nearest-neighbor and kernel-based regressors.
