FUNAvg: Federated Uncertainty Weighted Averaging for Datasets with Diverse Labels
Malte Tölle, Fernando Navarro, Sebastian Eble, Ivo Wolf, Bjoern Menze, Sandy Engelhardt
TL;DR
The paper tackles learning from partially annotated, privacy-preserving medical imaging datasets by federating a shared backbone while giving each site its own segmentation head. It introduces FUNAvg, which enforces uncertainty-weighted averaging of head predictions, leveraging MC dropout-derived uncertainty to reveal and utilize unannotated structures, thus improving predictions for underrepresented labels. The approach achieves Dice scores comparable to dataset-specific models and outperforms centralized baselines when incorporating uncertainty, with improved calibration. The method enables effective multi-dataset segmentation under privacy constraints and heterogeneous annotation protocols, offering practical benefits for medical image analysis and potential applicability to other domains.
Abstract
Federated learning is one popular paradigm to train a joint model in a distributed, privacy-preserving environment. But partial annotations pose an obstacle meaning that categories of labels are heterogeneous over clients. We propose to learn a joint backbone in a federated manner, while each site receives its own multi-label segmentation head. By using Bayesian techniques we observe that the different segmentation heads although only trained on the individual client's labels also learn information about the other labels not present at the respective site. This information is encoded in their predictive uncertainty. To obtain a final prediction we leverage this uncertainty and perform a weighted averaging of the ensemble of distributed segmentation heads, which allows us to segment "locally unknown" structures. With our method, which we refer to as FUNAvg, we are even on-par with the models trained and tested on the same dataset on average. The code is publicly available at https://github.com/Cardio-AI/FUNAvg.
