Piecewise Polynomial Regression of Tame Functions via Integer Programming
Gilles Bareilles, Johannes Aspman, Jiri Nemecek, Jakub Marecek
TL;DR
This work tackles accurate approximation of tame functions, definable in o-minimal structures, by piecewise polynomial functions over a domain $A$. It establishes a rigorous approximation bound that combines smooth-polynomial approximation on full-dimensional cells with a Lipschitz-based treatment of cell interfaces, yielding $||f-p||_{\infty,A} \le C_1 N^{-m} + C_2 l^{-2/(n-1)}$ for a piecewise polynomial family $\widetilde{P}_{N}^{l}(A)$. A novel mixed-integer programming formulation extends the OCT-H framework to regression with arbitrary-degree polynomials and affine-hyperplane partitions, enabling global optimization of the piecewise polynomial fit from sampled data. The authors demonstrate practical viability on tame neural networks, cone-like functions, and denoising tasks, while acknowledging scalability limitations and outlining avenues for improving optimization efficiency and extending applicability to higher dimensions. Overall, the paper provides both theoretical guarantees and a concrete optimization-based pipeline for estimating tame functions via hierarchical, piecewise-polynomial representations with potential impact across machine learning and optimization domains.
Abstract
Tame functions are a class of nonsmooth, nonconvex functions, which feature in a wide range of applications: functions encountered in the training of deep neural networks with all common activations, value functions of mixed-integer programs, or wave functions of small molecules. We consider approximating tame functions with piecewise polynomial functions. We bound the quality of approximation of a tame function by a piecewise polynomial function with a given number of segments on any full-dimensional cube. We also present the first mixed-integer programming formulation of piecewise polynomial regression. Together, these can be used to estimate tame functions. We demonstrate promising computational results.
