Table of Contents
Fetching ...

Local Differential Privacy for Distributed Stochastic Aggregative Optimization with Guaranteed Optimality

Ziqin Chen, Yongqiang Wang

Abstract

Distributed aggregative optimization underpins many cooperative optimization and multi-agent control systems, where each agent's objective function depends both on its local optimization variable and an aggregate of all agents' optimization variables. Existing distributed aggregative optimization approaches typically require access to accurate gradients of the objective functions, which, however, are often hard to obtain in real-world applications. For example, in machine learning, gradients are commonly contaminated by two main sources of noise: the randomness inherent in sampled data, and the additional variability introduced by mini-batch computations. In addition to the issue of relying on accurate gradients, existing distributed aggregative optimization approaches require agents to share explicit information, which could breach the privacy of participating agents. We propose an algorithm that can solve both problems with existing distributed aggregative optimization approaches: not only can the proposed algorithm guarantee mean-square convergence to an exact optimal solution when the gradients are subject to noise, it also simultaneously ensures rigorous differential privacy, with the cumulative privacy budget guaranteed to be finite even when the number of iterations tends to infinity. To the best of our knowledge, this is the first algorithm able to guarantee both accurate convergence and rigorous differential privacy in distributed aggregative optimization. Besides characterizing the convergence rates under nonconvex/convex/strongly convex conditions, we also rigorously quantify the cost of differential privacy in terms of convergence rates. Experimental results on personalized machine learning using benchmark datasets confirm the efficacy of the proposed algorithm.

Local Differential Privacy for Distributed Stochastic Aggregative Optimization with Guaranteed Optimality

Abstract

Distributed aggregative optimization underpins many cooperative optimization and multi-agent control systems, where each agent's objective function depends both on its local optimization variable and an aggregate of all agents' optimization variables. Existing distributed aggregative optimization approaches typically require access to accurate gradients of the objective functions, which, however, are often hard to obtain in real-world applications. For example, in machine learning, gradients are commonly contaminated by two main sources of noise: the randomness inherent in sampled data, and the additional variability introduced by mini-batch computations. In addition to the issue of relying on accurate gradients, existing distributed aggregative optimization approaches require agents to share explicit information, which could breach the privacy of participating agents. We propose an algorithm that can solve both problems with existing distributed aggregative optimization approaches: not only can the proposed algorithm guarantee mean-square convergence to an exact optimal solution when the gradients are subject to noise, it also simultaneously ensures rigorous differential privacy, with the cumulative privacy budget guaranteed to be finite even when the number of iterations tends to infinity. To the best of our knowledge, this is the first algorithm able to guarantee both accurate convergence and rigorous differential privacy in distributed aggregative optimization. Besides characterizing the convergence rates under nonconvex/convex/strongly convex conditions, we also rigorously quantify the cost of differential privacy in terms of convergence rates. Experimental results on personalized machine learning using benchmark datasets confirm the efficacy of the proposed algorithm.

Paper Structure

This paper contains 16 sections, 173 equations, 2 figures, 1 algorithm.

Figures (2)

  • Figure 1: Experimental results on CNN training using the "MNIST" dataset. (a) and (b) Training loss and cumulative privacy budget comparison of Algorithm \ref{['algorithm1']} with existing distributed aggregative optimization algorithms in Carnevale1 and lixiuxian2 under the same cumulative privacy budget, and existing DP approaches in huang and zhangjifeng1 for distributed optimization. (c) Training losses of Algorithm \ref{['algorithm1']} under different $\delta$ ($\delta$ is given in Corollary \ref{['values']}). (d) Training losses of Algorithm \ref{['algorithm1']} under different $\lambda$ ($\lambda$ is defined in \ref{['experimentloss']}). The error bars represent the standard deviation.
  • Figure 2: Experimental results on CNN training using the "CIFAR-10" dataset. (a) and (b) Training loss and cumulative privacy budget comparison of Algorithm \ref{['algorithm1']} with existing distributed aggregative optimization algorithms in Carnevale1 and lixiuxian2 under the same cumulative privacy budget, and existing DP approaches in huang and zhangjifeng1 for distributed optimization. (c) Training losses of Algorithm \ref{['algorithm1']} under different $\delta$ ($\delta$ is given in Corollary \ref{['values']}). (d) Training losses of Algorithm \ref{['algorithm1']} under different $\lambda$ ($\lambda$ is defined in \ref{['experimentloss']}). The error bars represent the standard deviation.

Theorems & Definitions (12)

  • proof
  • proof
  • proof
  • proof
  • proof
  • proof
  • proof
  • proof
  • proof
  • proof
  • ...and 2 more