Zeroth-Order Constrained Optimization from a Control Perspective via Feedback Linearization
Runyu Zhang, Gioele Zardini, Asuman Ozdaglar, Jeff Shamma, Na Li
TL;DR
This work tackles safe derivative-free constrained optimization with unknown constraints by introducing zeroth-order feedback-linearization (ZOFL), which leverages two-point zeroth-order estimators to form Jacobian–vector products and enforces feasibility via a linearized constraint dynamics. By modeling the optimization as a control system and applying a feedback-linearization design, ZOFL achieves high-probability constraint satisfaction with exponential contraction plus controllable residuals that scale with the zeroth-order radii and step size. The authors provide both equality- and inequality-constrained extensions, a midpoint discretization variant to reduce discretization bias, and extensive numerical validations showing improved feasibility relative to baselines while maintaining competitive objective values. The results offer a principled, theory-backed route for safe, derivative-free constrained optimization in nonconvex settings, with practical implications for safe learning and control under black-box constraints.
Abstract
Safe derivative-free optimization under unknown constraints is a fundamental challenge in modern learning and control. Existing zeroth-order (ZO) methods typically still assume access to a first-order oracle of the constraint functions or restrict attention to convex settings, leaving nonconvex optimization with black-box constraints largely unexplored. We propose the zeroth-order feedback-linearization (ZOFL) algorithm for ZO constrained optimization that enforces feasibility without access to the first-order oracle of the constraint functions and applies to both equality and inequality constraints. The proposed approach relies only on noisy, sample-based gradient estimates obtained via two-point estimators, yet provably guarantees constraint satisfaction under mild regularity conditions. It adopts a control-theoretic perspective on ZO constrained optimization and leverages feedback linearization, a nonlinear control technique, to enforce feasibility. Finite-time bounds on constraint violation and asymptotic global convergence guarantees are established for the ZOFL algorithm. A midpoint discretization variant is further developed to improve feasibility without sacrificing optimality. Empirical results demonstrate that ZOFL consistently outperforms standard ZO baselines, achieving competitive objective values while maintaining feasibility.
