Latency-aware Multimodal Federated Learning over UAV Networks
Shaba Shaon, Dinh C. Nguyen
TL;DR
The paper tackles latency in UAV-enabled Federated Multimodal Learning by formulating a joint optimization of sensing scheduling, UAV trajectory, and cross-layer resource allocation, under a non-convex loss with convergence guarantees. It adopts an attention-based fusion of multimodal embeddings at a UAV-BS architecture and proves a convergence bound that scales with rounds, UAVs, and modalities. To solve the latency minimization, the authors develop a BCD-SCA algorithm that decomposes the problem into three convexified sub-problems, enabling efficient iterative optimization. Empirical results show substantial latency reductions (up to 42.49%) and improved IID/non-IID learning performance, demonstrating the practical impact of latency-aware FML in dynamic UAV networks.
Abstract
This paper investigates federated multimodal learning (FML) assisted by unmanned aerial vehicles (UAVs) with a focus on minimizing system latency and providing convergence analysis. In this framework, UAVs are distributed throughout the network to collect data, participate in model training, and collaborate with a base station (BS) to build a global model. By utilizing multimodal sensing, the UAVs overcome the limitations of unimodal systems, enhancing model accuracy, generalization, and offering a more comprehensive understanding of the environment. The primary objective is to optimize FML system latency in UAV networks by jointly addressing UAV sensing scheduling, power control, trajectory planning, resource allocation, and BS resource management. To address the computational complexity of our latency minimization problem, we propose an efficient iterative optimization algorithm combining block coordinate descent and successive convex approximation techniques, which provides high-quality approximate solutions. We also present a theoretical convergence analysis for the UAV-assisted FML framework under a non-convex loss function. Numerical experiments demonstrate that our FML framework outperforms existing approaches in terms of system latency and model training performance under different data settings.
