Improving Wireless Federated Learning via Joint Downlink-Uplink Beamforming over Analog Transmission
Chong Zhang, Min Dong, Ben Liang, Ali Afana, Yahia Ahmed
TL;DR
This work tackles the degradation of federated learning caused by noisy wireless channels in a setting with analog downlink transmission and over-the-air uplink aggregation using a multi-antenna BS. It derives a round-trip global model update equation and upper bounds on the expected training loss after $T$ rounds, tying convergence to the joint downlink-uplink beamforming design. To optimize learning performance efficiently, it introduces JDUBF, with two variants JDUBF-R (receive-only uplink beamforming) and JDUBF-TR (joint transmit-receive uplink beamforming), solved online per round via a greedy BCD-PGD framework and aided by a closed-form initialization. Experiments on MNIST with realistic 5G-like channels show JDUBF-TR substantially improves convergence and can nearly match ideal FL under favorable conditions, underscoring the practical value of joint transceiver design for wireless FL.
Abstract
Federated learning (FL) over wireless networks using analog transmission can efficiently utilize the communication resource but is susceptible to errors caused by noisy wireless links. In this paper, assuming a multi-antenna base station, we jointly design downlink-uplink beamforming to maximize FL training convergence over time-varying wireless channels. We derive the round-trip model updating equation and use it to analyze the FL training convergence to capture the effects of downlink and uplink beamforming and the local model training on the global model update. Aiming to maximize the FL training convergence rate, we propose a low-complexity joint downlink-uplink beamforming (JDUBF) algorithm, which adopts a greedy approach to decompose the multi-round joint optimization and convert it into per-round online joint optimization problems. The per-round problem is further decomposed into three subproblems over a block coordinate descent framework, where we show that each subproblem can be efficiently solved by projected gradient descent with fast closed-form updates. An efficient initialization method that leads to a closed-form initial point is also proposed to accelerate the convergence of JDUBF. Simulation demonstrates that JDUBF substantially outperforms the conventional separate-link beamforming design.
