High-Probability Analysis of Online and Federated Zero-Order Optimisation

Arya Akhavan; David Janz; El-Mahdi El-Mhamdi

High-Probability Analysis of Online and Federated Zero-Order Optimisation

Arya Akhavan, David Janz, El-Mahdi El-Mhamdi

TL;DR

This paper addresses gradient-free federated optimisation by introducing FedZero, which uses ℓ1-sphere randomisation and two-point function evaluations to build compact gradient estimators. The authors prove high-probability regret bounds for federated convex zero-order optimisation and, in the single-worker regime, for classical convex zero-order methods, leveraging novel concentration inequalities on the ℓ1-sphere and time-uniform sub-Gamma bounds. The approach hinges on smoothing the objective with a controllable bias, then bounding the second-moment and deviation terms in a high-probability regime via advanced probabilistic tools. The results demonstrate both theoretical guarantees and practical advantages in communication efficiency, with ℓ1-based randomisation offering favorable tail behaviour and potential privacy benefits, and set the stage for extensions to non-convex or heterogeneous environments.

Abstract

We study distributed learning in the context of gradient-free zero-order optimisation and introduce FedZero, a federated zero-order algorithm with sharp theoretical guarantees. Our contributions are threefold. First, in the federated convex setting, we derive high-probability guarantees for regret minimisation achieved by FedZero. Second, in the single-worker regime, corresponding to the classical zero-order framework with two-point feedback, we establish the first high-probability convergence guarantees for convex zero-order optimisation, strengthening previous results that held only in expectation. Third, to establish these guarantees, we develop novel concentration tools: (i) concentration inequalities with explicit constants for Lipschitz functions under the uniform measure on the $\ell_1$-sphere, and (ii) a time-uniform concentration inequality for squared sub-Gamma random variables. These probabilistic results underpin our high-probability guarantees and may also be of independent interest.

High-Probability Analysis of Online and Federated Zero-Order Optimisation

TL;DR

Abstract

High-Probability Analysis of Online and Federated Zero-Order Optimisation

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (50)