Towards Energy-Aware Federated Learning via MARL: A Dual-Selection Approach for Model and Client
Jun Xia, Yi Zhang, Yiyu Shi
TL;DR
This work addresses energy-aware optimization in Federated Learning for battery-powered AIoT devices under heterogeneity. It introduces DR-FL, which maintains a layer-wise global model on the cloud and assigns subsets of this model to devices via a MARL-based dual-selection framework, enabling adaptive participation and model selection that balance accuracy, runtime, and energy use. The authors integrate QMIX-based cooperative reinforcement learning to jointly optimize layer selection and device participation, and they validate DR-FL on four datasets (CIFAR10/100, SVHN, Fashion-MNIST) with both simulation and real-wireless test-beds, showing improved accuracy and energy efficiency over state-of-the-art heterogeneous FL methods. The results demonstrate DR-FL’s scalability to large AIoT ecosystems and its robustness to non-IID data distributions, underscoring its practical impact for energy-constrained collaborative learning.
Abstract
Although Federated Learning (FL) is promising in knowledge sharing for heterogeneous Artificial Intelligence of Thing (AIoT) devices, their training performance and energy efficacy are severely restricted in practical battery-driven scenarios due to the ``wooden barrel effect'' caused by the mismatch between homogeneous model paradigms and heterogeneous device capability. As a result, due to various kinds of differences among devices, it is hard for existing FL methods to conduct training effectively in energy-constrained scenarios, such as battery constraints of devices. To tackle the above issues, we propose an energy-aware FL framework named DR-FL, which considers the energy constraints in both clients and heterogeneous deep learning models to enable energy-efficient FL. Unlike Vanilla FL, DR-FL adopts our proposed Muti-Agents Reinforcement Learning (MARL)-based dual-selection method, which allows participated devices to make contributions to the global model effectively and adaptively based on their computing capabilities and energy capacities in a MARL-based manner. Experiments conducted with various widely recognized datasets demonstrate that DR-FL has the capability to optimize the exchange of knowledge among diverse models in large-scale AIoT systems while adhering to energy limitations. Additionally, it improves the performance of each individual heterogeneous device's model.
