Online Distributed Learning with Quantized Finite-Time Coordination

Nicola Bastianello; Apostolos I. Rikos; Karl H. Johansson

Online Distributed Learning with Quantized Finite-Time Coordination

Nicola Bastianello, Apostolos I. Rikos, Karl H. Johansson

TL;DR

The paper tackles online distributed optimization over directed, fully decentralized networks without a fusion center. It introduces a distributed online projected gradient method that uses a finite-time quantized coordination (FTQC) protocol to approximate the consensus projection with quantized communications and accommodates stochastic gradients. A convergence analysis shows mean convergence bounds that capture the effects of quantization ($ ext{step size }\Delta$), gradient inaccuracy ($\tau$), and time-varying costs ($\sigma$), yielding a limiting error of $ (\\sigma + \\gamma + \\alpha \\tau)/(1 - \\\zeta)$, where $\\zeta$ depends on the objective's strong convexity and smoothness. Numerical results on online logistic regression demonstrate that FTQC-DGD can achieve smaller asymptotic errors than alternatives under quantization, while revealing the trade-offs between batch size, quantization level, and online data shifts. Overall, the work provides a scalable, robust framework for privacy-preserving, bandwidth-efficient online learning in peer-to-peer networks with directed communication.

Abstract

In this paper we consider online distributed learning problems. Online distributed learning refers to the process of training learning models on distributed data sources. In our setting a set of agents need to cooperatively train a learning model from streaming data. Differently from federated learning, the proposed approach does not rely on a central server but only on peer-to-peer communications among the agents. This approach is often used in scenarios where data cannot be moved to a centralized location due to privacy, security, or cost reasons. In order to overcome the absence of a central server, we propose a distributed algorithm that relies on a quantized, finite-time coordination protocol to aggregate the locally trained models. Furthermore, our algorithm allows for the use of stochastic gradients during local training. Stochastic gradients are computed using a randomly sampled subset of the local training data, which makes the proposed algorithm more efficient and scalable than traditional gradient descent. In our paper, we analyze the performance of the proposed algorithm in terms of the mean distance from the online solution. Finally, we present numerical results for a logistic regression task.

Online Distributed Learning with Quantized Finite-Time Coordination

TL;DR

), gradient inaccuracy (

), and time-varying costs (

), yielding a limiting error of

, where

depends on the objective's strong convexity and smoothness. Numerical results on online logistic regression demonstrate that FTQC-DGD can achieve smaller asymptotic errors than alternatives under quantization, while revealing the trade-offs between batch size, quantization level, and online data shifts. Overall, the work provides a scalable, robust framework for privacy-preserving, bandwidth-efficient online learning in peer-to-peer networks with directed communication.

Abstract

Paper Structure (17 sections, 2 theorems, 20 equations, 2 figures, 2 tables, 2 algorithms)

This paper contains 17 sections, 2 theorems, 20 equations, 2 figures, 2 tables, 2 algorithms.

Introduction
Problem Formulation
Algorithm
Challenges
Algorithm
Algorithm \ref{['alg:main-algorithm']}
Algorithm \ref{['alg:finite-time-consensus']}
Alternative $\mathop{\mathrm{proj}}\nolimits_\mathcal{C}$ implementations
Average consensus
Gradient tracking
Centralized aggregation
Convergence Analysis
Numerical Results
Quantization
Stochastic gradients
...and 2 more sections

Key Result

Lemma 1

Let $\mathbold{x}_{k+1} = \pmb{1}_N \otimes x_{k+1}$, be the output of Algorithm alg:finite-time-consensus as applied in Algorithm alg:main-algorithm (line 5) to distributedly approximate $\mathop{\mathrm{proj}}\nolimits_\mathcal{C}$. Then it holds that

Figures (2)

Figure 1: Comparison of the proposed FTQC-DGD with Near-DGD and DGT, using quantized communications with $\Delta = 0.01$.
Figure 2: Tracking error of FTQC-DGD applied to an online logistic regression problem, with different quantization levels.

Theorems & Definitions (5)

Lemma 1: Coordination error
proof
Proposition 1: Convergence in mean
proof
Remark 1

Online Distributed Learning with Quantized Finite-Time Coordination

TL;DR

Abstract

Online Distributed Learning with Quantized Finite-Time Coordination

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (5)