A Theoretical Analysis of Efficiency Constrained Utility-Privacy Bi-Objective Optimization in Federated Learning

Hanlin Gu; Xinyuan Zhao; Gongxi Zhu; Yuxing Han; Yan Kang; Lixin Fan; Qiang Yang

A Theoretical Analysis of Efficiency Constrained Utility-Privacy Bi-Objective Optimization in Federated Learning

Hanlin Gu, Xinyuan Zhao, Gongxi Zhu, Yuxing Han, Yan Kang, Lixin Fan, Qiang Yang

TL;DR

This work tackles the DPFL privacy-utility-efficiency trade-off by formulating a constrained bi-objective optimization over the trio $(\sigma, T, q)$ to balance privacy leakage $\epsilon_p$, utility loss $\epsilon_u$, and training efficiency $\epsilon_e$. It derives an analytical Pareto front, anchored by the key relation $k\sigma^2 T = qK$, and shows how to reduce the problem to tractable forms under various constraints. Theoretical results are complemented by extensive experiments on MNIST (LR/LeNet) and CIFAR-10 (ResNet-18), with strong alignment between theory and empirical Pareto fronts, enabling low-cost parameter design for DPFL. The findings offer a principled pathway to jointly optimize privacy, utility, and training time in real deployments, reducing search costs and informing practical DPFL configurations.

Abstract

Federated learning (FL) enables multiple clients to collaboratively learn a shared model without sharing their individual data. Concerns about utility, privacy, and training efficiency in FL have garnered significant research attention. Differential privacy has emerged as a prevalent technique in FL, safeguarding the privacy of individual user data while impacting utility and training efficiency. Within Differential Privacy Federated Learning (DPFL), previous studies have primarily focused on the utility-privacy trade-off, neglecting training efficiency, which is crucial for timely completion. Moreover, differential privacy achieves privacy by introducing controlled randomness (noise) on selected clients in each communication round. Previous work has mainly examined the impact of noise level ($σ$) and communication rounds ($T$) on the privacy-utility dynamic, overlooking other influential factors like the sample ratio ($q$, the proportion of selected clients). This paper systematically formulates an efficiency-constrained utility-privacy bi-objective optimization problem in DPFL, focusing on $σ$, $T$, and $q$. We provide a comprehensive theoretical analysis, yielding analytical solutions for the Pareto front. Extensive empirical experiments verify the validity and efficacy of our analysis, offering valuable guidance for low-cost parameter design in DPFL.

A Theoretical Analysis of Efficiency Constrained Utility-Privacy Bi-Objective Optimization in Federated Learning

TL;DR

This work tackles the DPFL privacy-utility-efficiency trade-off by formulating a constrained bi-objective optimization over the trio

to balance privacy leakage

, utility loss

, and training efficiency

. It derives an analytical Pareto front, anchored by the key relation

, and shows how to reduce the problem to tractable forms under various constraints. Theoretical results are complemented by extensive experiments on MNIST (LR/LeNet) and CIFAR-10 (ResNet-18), with strong alignment between theory and empirical Pareto fronts, enabling low-cost parameter design for DPFL. The findings offer a principled pathway to jointly optimize privacy, utility, and training time in real deployments, reducing search costs and informing practical DPFL configurations.

Abstract

) and communication rounds (

) on the privacy-utility dynamic, overlooking other influential factors like the sample ratio (

, the proportion of selected clients). This paper systematically formulates an efficiency-constrained utility-privacy bi-objective optimization problem in DPFL, focusing on

, and

. We provide a comprehensive theoretical analysis, yielding analytical solutions for the Pareto front. Extensive empirical experiments verify the validity and efficacy of our analysis, offering valuable guidance for low-cost parameter design in DPFL.

Paper Structure (12 sections, 4 theorems, 22 equations, 1 figure, 1 table, 2 algorithms)

This paper contains 12 sections, 4 theorems, 22 equations, 1 figure, 1 table, 2 algorithms.

Introduction
Related Work
Federated Learning
Differential Privacy
Multi-objective Optimization
MOO in DPFL
Setting and Threat Model
Privacy Leakage, Utility Loss, and Training Efficiency in DPFL
Constrained Bi-Objective Optimization in DPFL
Theoretical Analysis
Simplified Formulation
Pareto Optimal Solutions

Key Result

Theorem 1

There exists constant $r$ and constant $s$ so that given the sampling probability $q=\frac{B}{N}$ ($N$ is the number of training set) and the number of total rounds $T$, for any $\epsilon < r q^2 T$, the differentially private SGD algorithm abadi2016deeplearningwithDP is $(\epsilon, \delta)$-differe

Figures (1)

Figure :

Theorems & Definitions (12)

Definition 1: $(\epsilon, \delta)$-differential privacy in abadi2016deeplearningwithDP
Theorem 1: Thm. 1 of abadi2016deeplearningwithDP
Definition 2: Pareto dominance in gunantara2018review
Definition 3: Pareto optimal solution in gunantara2018review
Definition 4: Pareto set and front in gunantara2018review
Lemma 1: Adapted from Cor. 3.2.1 of zhang2022understanding
Remark 1
Definition 5
Lemma 2
Proof 1
...and 2 more

A Theoretical Analysis of Efficiency Constrained Utility-Privacy Bi-Objective Optimization in Federated Learning

TL;DR

Abstract

A Theoretical Analysis of Efficiency Constrained Utility-Privacy Bi-Objective Optimization in Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (12)