Energy-Aware Heterogeneous Federated Learning via Approximate DNN Accelerators
Kilian Pfeiffer, Konstantinos Balaskas, Kostas Siozios, Jörg Henkel
TL;DR
This work addresses the challenge of heterogeneous energy resources in federated learning by introducing training-capable on-device accelerators tailored to each device’s energy budget. It combines compressed arithmetic formats and approximate computing within a hardware-aware energy model to substantially reduce training energy, up to about 4×, without sacrificing global model accuracy or fairness across devices. Unlike prior algorithmic approaches, the method designs the hardware at the device level to accommodate energy constraints while keeping full model capacity. The results demonstrate improved energy efficiency and fairness in FL, with practical implications for deploying privacy-preserving learning on resource-constrained edge devices.
Abstract
In Federated Learning (FL), devices that participate in the training usually have heterogeneous resources, i.e., energy availability. In current deployments of FL, devices that do not fulfill certain hardware requirements are often dropped from the collaborative training. However, dropping devices in FL can degrade training accuracy and introduce bias or unfairness. Several works have tackled this problem on an algorithm level, e.g., by letting constrained devices train a subset of the server neural network (NN) model. However, it has been observed that these techniques are not effective w.r.t. accuracy. Importantly, they make simplistic assumptions about devices' resources via indirect metrics such as multiply accumulate (MAC) operations or peak memory requirements. We observe that memory access costs (that are currently not considered in simplistic metrics) have a significant impact on the energy consumption. In this work, for the first time, we consider on-device accelerator design for FL with heterogeneous devices. We utilize compressed arithmetic formats and approximate computing, targeting to satisfy limited energy budgets. Using a hardware-aware energy model, we observe that, contrary to the state of the art's moderate energy reduction, our technique allows for lowering the energy requirements (by 4x) while maintaining higher accuracy.
