MedHE: Communication-Efficient Privacy-Preserving Federated Learning with Adaptive Gradient Sparsification for Healthcare
Farjana Yesmin
TL;DR
MedHE addresses the dual challenge of privacy and communication efficiency in healthcare federated learning by co-designing adaptive gradient sparsification with CKKS homomorphic encryption. The framework introduces an error-feedback-based sparsification scheme and optimized ciphertext packing to achieve a 97.5% reduction in per-round communication while preserving model utility, demonstrated by accuracy of $89.5\% \pm 0.8\%$ across five trials and a minimal $p$-value difference ($p=0.32$) versus standard FL. Formal security analysis provides IND-CPA guarantees for CKKS under RLWE, differential privacy with $\epsilon$ bounded (e.g., $\epsilon\leq 1$ in experiments), and convergence assurances under adaptive sparsity with error compensation. Deployment analyses confirm HIPAA-aligned privacy, scalability to 100+ institutions, and practical cost reductions, making MedHE a deployment-ready approach for secure, large-scale, privacy-preserving healthcare collaboration.
Abstract
Healthcare federated learning requires strong privacy guarantees while maintaining computational efficiency across resource-constrained medical institutions. This paper presents MedHE, a novel framework combining adaptive gradient sparsification with CKKS homomorphic encryption to enable privacy-preserving collaborative learning on sensitive medical data. Our approach introduces a dynamic threshold mechanism with error compensation for top-k gradient selection, achieving 97.5 percent communication reduction while preserving model utility. We provide formal security analysis under Ring Learning with Errors assumptions and demonstrate differential privacy guarantees with epsilon less than or equal to 1.0. Statistical testing across 5 independent trials shows MedHE achieves 89.5 percent plus or minus 0.8 percent accuracy, maintaining comparable performance to standard federated learning (p=0.32) while reducing communication from 1277 MB to 32 MB per training round. Comprehensive evaluation demonstrates practical feasibility for real-world medical deployments with HIPAA compliance and scalability to 100 plus institutions.
