Table of Contents
Fetching ...

Novel Constructions for Computation and Communication Trade-offs in Private Coded Distributed Computing

Shanuja Sasi, Onur Günlü

TL;DR

The paper tackles privacy in distributed computation by enforcing task-index privacy in a MapReduce-like CDC setting. It introduces an extended PDA framework that combines two PDAs to produce private CDC schemes, and derives two families of extended PDAs to analyze computation-communication trade-offs under privacy. The first construction preserves the same computation load as non-private CDC but scales the communication load by a factor $Q$, while the second construction yields improved trade-offs by trading some computation for reduced communication, with explicit load expressions showing the privacy penalties and gains. The work blends ideas from coded caching, PDAs, and distributed computing to provide information-theoretic privacy for task assignments without encrypting data, offering a path toward practical, privacy-aware private CDC designs albeit with scalability challenges. The results quantify how privacy constraints impact communication, and propose PDA-based strategies to mitigate overhead and achieve near-optimal private CDC performance in decentralized settings.

Abstract

Distributed computing enables scalable machine learning by distributing tasks across multiple nodes, but ensuring privacy in such systems remains a challenge. This paper introduces a novel private coded distributed computing model that integrates privacy constraints to keep task assignments hidden. By leveraging placement delivery arrays (PDAs), we design an extended PDA framework to characterize achievable computation and communication loads under privacy constraints. By constructing two classes of extended PDAs, we explore the trade-offs between computation and communication, showing that although privacy increases communication overhead, it can be significantly alleviated through optimized PDA-based coded strategies.

Novel Constructions for Computation and Communication Trade-offs in Private Coded Distributed Computing

TL;DR

The paper tackles privacy in distributed computation by enforcing task-index privacy in a MapReduce-like CDC setting. It introduces an extended PDA framework that combines two PDAs to produce private CDC schemes, and derives two families of extended PDAs to analyze computation-communication trade-offs under privacy. The first construction preserves the same computation load as non-private CDC but scales the communication load by a factor , while the second construction yields improved trade-offs by trading some computation for reduced communication, with explicit load expressions showing the privacy penalties and gains. The work blends ideas from coded caching, PDAs, and distributed computing to provide information-theoretic privacy for task assignments without encrypting data, offering a path toward practical, privacy-aware private CDC designs albeit with scalability challenges. The results quantify how privacy constraints impact communication, and propose PDA-based strategies to mitigate overhead and achieve near-optimal private CDC performance in decentralized settings.

Abstract

Distributed computing enables scalable machine learning by distributing tasks across multiple nodes, but ensuring privacy in such systems remains a challenge. This paper introduces a novel private coded distributed computing model that integrates privacy constraints to keep task assignments hidden. By leveraging placement delivery arrays (PDAs), we design an extended PDA framework to characterize achievable computation and communication loads under privacy constraints. By constructing two classes of extended PDAs, we explore the trade-offs between computation and communication, showing that although privacy increases communication overhead, it can be significantly alleviated through optimized PDA-based coded strategies.

Paper Structure

This paper contains 20 sections, 62 equations, 1 figure, 2 algorithms.

Figures (1)

  • Figure 1: Comparison of computation–communication trade-offs for non-private and private CDC schemes. Here, the number of nodes considered is $K = 3$ and output functions is $Q=3$.

Theorems & Definitions (14)

  • Definition 1: Computation Load LMA
  • Definition 2: Communication Load LMA
  • Definition 3
  • Example 1
  • Definition 4
  • Example 2
  • Example 3
  • Example 4
  • Example 5
  • Example 6
  • ...and 4 more