CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

Jieming Bian; Lei Wang; Shaolei Ren; Jie Xu

CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

Jieming Bian, Lei Wang, Shaolei Ren, Jie Xu

TL;DR

This work tackles the carbon footprint challenge of training large AI models across geo-distributed data centers by formulating Carbon-Aware Federated Learning (CAFE). It combines coreset-based learning utility, a Lyapunov drift-plus-penalty online optimization, and submodular maximization to select data centers under a fixed carbon budget. The framework provides theoretical guarantees and practical algorithms (deterministic and randomized double greedy) to balance learning performance with emissions, demonstrated via simulations on real carbon-intensity data and CIFAR tasks. The results show CAFE can outperform baselines in learning accuracy while respecting environmental constraints, offering a scalable approach for green AI in distributed infrastructures.

Abstract

Training large-scale artificial intelligence (AI) models demands significant computational power and energy, leading to increased carbon footprint with potential environmental repercussions. This paper delves into the challenges of training AI models across geographically distributed (geo-distributed) data centers, emphasizing the balance between learning performance and carbon footprint. We consider Federated Learning (FL) as a solution, which prioritizes model parameter exchange over raw data, ensuring data privacy and compliance with local regulations. Given the variability in carbon intensity across regions, we propose a new framework called CAFE (short for Carbon-Aware Federated Learning) to optimize training within a fixed carbon footprint budget. Our approach incorporates coreset selection to assess learning performance, employs the Lyapunov drift-plus-penalty framework to address the unpredictability of future carbon intensity, and devises an efficient algorithm to address the combinatorial complexity of the data center selection. Through extensive simulations using real-world carbon intensity data, we demonstrate the efficacy of our algorithm, highlighting its superiority over existing methods in optimizing learning performance while minimizing environmental impact.

CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

TL;DR

Abstract

Paper Structure (31 sections, 3 theorems, 17 equations, 14 figures, 1 algorithm)

This paper contains 31 sections, 3 theorems, 17 equations, 14 figures, 1 algorithm.

Introduction
Related Works
System Model
Carbon Footprint
Learning Performance
Problem Formulation
Offline Benchmark
Online Data Center Selection
Methodology
Per-Slot Problem
Performance Analysis
Simulation Results
Setup
Data Centers
Simulated Learning Tasks
...and 16 more sections

Key Result

Proposition 1

With the same selection decision $\boldsymbol{a^t}$, the absolute difference between the utility value based on different model parameters $w_1$ and $w_2$ is bounded as follows:

Figures (14)

Figure 1: The size of current AI models is experiencing exponential growth.
Figure 2: The carbon intensity varies across different times and data centers.
Figure 3: The overview of CAFE.
Figure 4: Impact of per-slot algorithms.
Figure 5: Performance Comparison on CIFAR-10.
...and 9 more figures

Theorems & Definitions (4)

Definition 1
Proposition 1
Theorem 1
Theorem 2

CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

TL;DR

Abstract

CAFE: Carbon-Aware Federated Learning in Geographically Distributed Data Centers

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (14)

Theorems & Definitions (4)