First Analysis of Local GD on Heterogeneous Data

Ahmed Khaled; Konstantin Mishchenko; Peter Richtárik

First Analysis of Local GD on Heterogeneous Data

Ahmed Khaled, Konstantin Mishchenko, Peter Richtárik

TL;DR

It is shown that in a low accuracy regime, the local gradient descent method has the same communication complexity as gradient descent.

Abstract

We provide the first convergence analysis of local gradient descent for minimizing the average of smooth and convex but otherwise arbitrary functions. Problems of this form and local gradient descent as a solution method are of importance in federated learning, where each function is based on private data stored by a user on a mobile device, and the data of different users can be arbitrarily heterogeneous. We show that in a low accuracy regime, the method has the same communication complexity as gradient descent.

First Analysis of Local GD on Heterogeneous Data

TL;DR

Abstract

First Analysis of Local GD on Heterogeneous Data

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (11)