A Carbon Tracking Model for Federated Learning: Impact of Quantization and Sparsification
Luca Barbieri, Stefano Savazzi, Sanaz Kianoush, Monica Nicoli, Luigi Serio
TL;DR
The paper introduces a real-time carbon tracking framework to quantify the energy and carbon footprint of Federated Learning across both centralized (FA) and consensus-based (CFA) architectures. It models per-round emissions and energy costs, incorporating computing, communication, and grid carbon intensity, and evaluates compression strategies—top-$t$ sparsification and probabilistic quantization (qSGD)—on MNIST and CIFAR10 tasks. Key findings show CFA outperforms FA under energy-inefficient communications (low $EE_{COM}$), while FA is preferable when communication is efficient, and that quantization/sparsification can reduce emissions by substantial margins (roughly 42-50%). The results provide practical guidelines for energy-efficient FL design, including optimal compression settings and PS placement relative to grid carbon intensity, with potential for adaptive, heterogeneous deployments in AIoT contexts.
Abstract
Federated Learning (FL) methods adopt efficient communication technologies to distribute machine learning tasks across edge devices, reducing the overhead in terms of data storage and computational complexity compared to centralized solutions. Rather than moving large data volumes from producers (sensors, machines) to energy-hungry data centers, raising environmental concerns due to resource demands, FL provides an alternative solution to mitigate the energy demands of several learning tasks while enabling new Artificial Intelligence of Things (AIoT) applications. This paper proposes a framework for real-time monitoring of the energy and carbon footprint impacts of FL systems. The carbon tracking tool is evaluated for consensus (fully decentralized) and classical FL policies. For the first time, we present a quantitative evaluation of different computationally and communication efficient FL methods from the perspectives of energy consumption and carbon equivalent emissions, suggesting also general guidelines for energy-efficient design. Results indicate that consensus-driven FL implementations should be preferred for limiting carbon emissions when the energy efficiency of the communication is low (i.e., < 25 Kbit/Joule). Besides, quantization and sparsification operations are shown to strike a balance between learning performances and energy consumption, leading to sustainable FL designs.
