Moving higher-order Taylor approximations method for smooth constrained minimization problems
Yassine Nabou, Ion Necoara
TL;DR
The paper develops Moving Taylor Approximation (MTA), a higher-order method for composite smooth minimization with smooth functional constraints, by replacing the objective and constraints with higher-order Taylor models plus regularization. It proves global convergence to KKT points in the nonconvex case and provides convergence rates under the KL property, with sublinear to linear behavior depending on the KL exponent; in the convex and uniformly convex cases, it yields sublinear and (potentially) linear or superlinear rates in function values. The authors also show that subproblems are implementable (especially for p,q ≤ 2) via convex optimization techniques and provide an adaptive variant that eliminates the need for Lipschitz constants. Numerical experiments illustrate that higher-order MTA variants outperform first-order SCP/MBA approaches, highlighting improved efficiency and faster residual reduction. Overall, MTA offers a scalable, higher-order framework for smooth constrained optimization with strong theoretical guarantees and practical solvability.
Abstract
In this paper we develop a higher-order method for solving composite (non)convex minimization problems with smooth (non)convex functional constraints. At each iteration our method approximates the smooth part of the objective function and of the constraints by higher-order Taylor approximations, leading to a moving Taylor approximation method (MTA). We present convergence guarantees for MTA algorithm for both, nonconvex and convex problems. In particular, when the objective and the constraints are nonconvex functions, we prove that the sequence generated by MTA algorithm converges globally to a KKT point. Moreover, we derive convergence rates in the iterates when the problem data satisfy the Kurdyka-Lojasiewicz (KL) property. Further, when the objective function is (uniformly) convex and the constraints are also convex, we provide (linear/superlinear) sublinear convergence rates for our algorithm. Finally, we present an efficient implementation of the proposed algorithm and compare it with existing methods from the literature.
