Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?

Jan Fiszer; Dominika Ciupek; Maciej Malawski

Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?

Jan Fiszer, Dominika Ciupek, Maciej Malawski

TL;DR

This study addresses privacy and data-heterogeneity challenges in brain tumor segmentation by simulating non-IID conditions through six MRI intensity normalization schemes (five normalized methods plus raw data). It compares centralized training against federated learning variants (FedAvg and FedBN) and finds that FL can achieve near-parity with a centralized model, demonstrated by a 3D Dice score of $92\%$ on held-out data. Nyul normalization emerges as particularly problematic, while Z-score normalization provides broad cross-dataset compatibility; the results support FL as a practical, privacy-preserving approach for multi-site medical imaging tasks. The work also offers actionable guidelines on preprocessing choices and FL deployment, contributing to scalable, collaborative brain tumor segmentation with minimal data sharing.

Abstract

Deep learning (DL) has been increasingly applied in medical imaging, however, it requires large amounts of data, which raises many challenges related to data privacy, storage, and transfer. Federated learning (FL) is a training paradigm that overcomes these issues, though its effectiveness may be reduced when dealing with non-independent and identically distributed (non-IID) data. This study simulates non-IID conditions by applying different MRI intensity normalization techniques to separate data subsets, reflecting a common cause of heterogeneity. These subsets are then used for training and testing models for brain tumor segmentation. The findings provide insights into the influence of the MRI intensity normalization methods on segmentation models, both training and inference. Notably, the FL methods demonstrated resilience to inconsistently normalized data across clients, achieving the 3D Dice score of 92%, which is comparable to a centralized model (trained using all data). These results indicate that FL is a solution to effectively train high-performing models without violating data privacy, a crucial concern in medical applications. The code is available at: https://github.com/SanoScience/fl-varying-normalization.

Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?

TL;DR

Abstract

Validation of Various Normalization Methods for Brain Tumor Segmentation: Can Federated Learning Overcome This Heterogeneity?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)