Distributed client selection with multi-objective in federated learning assisted Internet of Vehicles
Narisu Cha, Long Chang
TL;DR
The paper addresses the large overhead of maintaining an active state in federated learning over IoV environments with millions of vehicles. It proposes a distributed client selection framework in which evaluation and selection are performed locally (via neighbor exchanges) using a multi-objective fuzzy evaluator that considers sample quantity, throughput, computational capability, and local dataset diversity. Key contributions include (i) a decentralized client selection mechanism that avoids collecting all participant states at a central server, (ii) a fuzzy Mamdani-based evaluator to handle four heterogeneous inputs, (iii) throughput prediction and training-time modeling to inform decision making, and (iv) extensive simulations showing the approach closely approximates centralized selection while reducing active-state overhead. The findings indicate significant reductions in communication overhead and robust performance under non-i.i.d data and mobility, enabling scalable FL in IoV with practical resource usage.
Abstract
Federated learning is an emerging distributed machine learning framework in the Internet of Vehicles (IoV). In IoV, millions of vehicles are willing to train the model to share their knowledge. Maintaining an active state means the participants must update their state to the FL server in a fixed interval and participate to next round. However, the cost by maintaining an active state is very large when there are a huge number of participating vehicles. In this paper, we proposed a distributed client selection scheme to reduce the cost of maintaining the active state for all participants. The clients with the highest evaluation are elected among the neighbours. In the evaluator, four variables are considered including sample quantity, throughput available, computational capability and the quality of the local dataset. We adopted fuzzy logic as the evaluator since the closed-form solution over four variables does not exist. Extensive simulation results show our proposal approximates the centralized client selection in terms of accuracy and can significantly reduce the communication overhead.
