FedProphet: Memory-Efficient Federated Adversarial Training via Robust and Consistent Cascade Learning
Minxue Tang, Yitu Wang, Jingyang Zhang, Louis DiValentin, Aolin Ding, Amin Hass, Yiran Chen, Hai "Helen" Li
TL;DR
FedProphet tackles the challenge of memory-efficient Federated Adversarial Training by partitioning a large backbone into cascaded modules that can be trained on memory-constrained devices without model swapping. On the server, a memory-aware model partitioner and a training coordinator with Adaptive Perturbation Adjustment and Differentiated Module Assignment coordinate module allocation and perturbation sizing, while a Partial-Average Model Aggregator compiles heterogeneous updates. Theoretical results show that strong convexity regularization bounds feature perturbations and reduces gradient inconsistency, enabling robustness to transfer from modules to the full backbone. Empirically, FedProphet achieves up to 80% memory reduction and up to 10.8x training-time speedup while maintaining competitive accuracy and adversarial robustness across balanced and unbalanced device settings, outperforming existing memory-efficient FAT baselines. This framework offers practical scalability for robust, privacy-preserving learning on edge devices and suggests promising extensions to NLP and other memory-saving approaches.
Abstract
Federated Adversarial Training (FAT) can supplement robustness against adversarial examples to Federated Learning (FL), promoting a meaningful step toward trustworthy AI. However, FAT requires large models to preserve high accuracy while achieving strong robustness, incurring high memory-swapping latency when training on memory-constrained edge devices. Existing memory-efficient FL methods suffer from poor accuracy and weak robustness due to inconsistent local and global models. In this paper, we propose FedProphet, a novel FAT framework that can achieve memory efficiency, robustness, and consistency simultaneously. FedProphget reduces the memory requirement in local training while guaranteeing adversarial robustness by adversarial cascade learning with strong convexity regularization, and we show that the strong robustness also implies low inconsistency in FedProphet. We also develop a training coordinator on the server of FL, with Adaptive Perturbation Adjustment for utility-robustness balance and Differentiated Module Assignment for objective inconsistency mitigation. FedPeophet significantly outperforms other baselines under different experimental settings, maintaining the accuracy and robustness of end-to-end FAT with 80% memory reduction and up to 10.8x speedup in training time.
