Bridging Local and Federated Data Normalization in Federated Learning: A Privacy-Preserving Approach
Melih Coşğun, Mert Gençtürk, Sinem Sav
TL;DR
This work tackles the challenge of data normalization in federated learning under non-IID data by introducing Federated Normalization (FedNorm), which simulates pooled normalization through privacy-preserving parameter exchange. The authors extend this idea with Privacy-Preserving Federated (PPF) normalization protocols for Z-score, MinMax, and Robust scaling, all implemented via multiparty fully homomorphic encryption (MHE) on the CKKS scheme. A key technical contribution is a novel encrypted $k$-th ranked element calculation that enables robust scaling entirely in the encrypted domain, reducing information leakage. Empirically, federated normalization consistently outperforms local normalization in non-IID settings, and the PPF protocols achieve practical runtimes with controllable precision loss, enabling secure, scalable PPFL preprocessing. Overall, the paper provides a comprehensive framework for privacy-preserving data normalization in FL, with broad applicability to both regression and classification tasks and potential extensions beyond normalization primitives.
Abstract
Data normalization is a crucial preprocessing step for enhancing model performance and training stability. In federated learning (FL), where data remains distributed across multiple parties during collaborative model training, normalization presents unique challenges due to the decentralized and often heterogeneous nature of the data. Traditional methods rely on either independent client-side processing, i.e., local normalization, or normalizing the entire dataset before distributing it to parties, i.e., pooled normalization. Local normalization can be problematic when data distributions across parties are non-IID, while the pooled normalization approach conflicts with the decentralized nature of FL. In this paper, we explore the adaptation of widely used normalization techniques to FL and define the term federated normalization. Federated normalization simulates pooled normalization by enabling the collaborative exchange of normalization parameters among parties. Thus, it achieves performance on par with pooled normalization without compromising data locality. However, sharing normalization parameters such as the mean introduces potential privacy risks, which we further mitigate through a robust privacy-preserving solution. Our contributions include: (i) We systematically evaluate the impact of various federated and local normalization techniques in heterogeneous FL scenarios, (ii) We propose a novel homomorphically encrypted $k$-th ranked element (and median) calculation tailored for the federated setting, enabling secure and efficient federated normalization, (iii) We propose privacy-preserving implementations of widely used normalization techniques for FL, leveraging multiparty fully homomorphic encryption (MHE).
