Closing the Generalization Gap in Parameter-efficient Federated Edge Learning
Xinnong Du, Zhonghao Lyu, Xiaowen Cao, Chunyang Wen, Shuguang Cui, Jie Xu
TL;DR
This paper tackles the generalization gap in parameter-efficient FEEL under non-IID data and resource constraints. It introduces an information-theoretic generalization bound and embeds it into a convergence analysis to guide a joint optimization of client participation, pruning, and wireless/computational resources via alternating optimization. The proposed framework yields a suboptimal yet effective solution that improves test accuracy while honoring energy and delay limits, as demonstrated on MNIST and CIFAR-10 with Dirichlet-based non-IID partitions. The work highlights the value of integrating generalization-aware analysis with system-level optimization to enhance robustness and efficiency of edge AI deployments.
Abstract
Federated edge learning (FEEL) provides a promising foundation for edge artificial intelligence (AI) by enabling collaborative model training while preserving data privacy. However, limited and heterogeneous local datasets, as well as resource-constrained deployment, severely degrade both model generalization and resource utilization, leading to a compromised learning performance. Therefore, we propose a parameter-efficient FEEL framework that jointly leverages model pruning and client selection to tackle such challenges. First, we derive an information-theoretic generalization statement that characterizes the discrepancy between training and testing function losses and embed it into the convergence analysis. It reveals that a larger local generalization statement can undermine the global convergence. Then, we formulate a generalization-aware average squared gradient norm bound minimization problem, by jointly optimizing the pruning ratios, client selection, and communication-computation resources under energy and delay constraints. Despite its non-convexity, the resulting mixed-integer problem is efficiently solved via an alternating optimization algorithm. Extensive experiments demonstrate that the proposed design achieves superior learning performance than state-of-the-art baselines, validating the effectiveness of coupling generalization-aware analysis with system-level optimization for efficient FEEL.
