MultiTalent: A Multi-Dataset Approach to Medical Image Segmentation
Constantin Ulrich, Fabian Isensee, Tassilo Wald, Maximilian Zenk, Michael Baumgartner, Klaus H. Maier-Hein
TL;DR
MultiTalent tackles the fragmentation of publicly available medical imaging data by training a single foundation segmentation model across 13 partially labeled abdominal CT datasets with conflicting annotations. It introduces decoupled per-class heads with Sigmoid outputs and a dataset-adaptive loss to preserve diverse label definitions while allowing overlapping structures. The approach delivers consistent improvements over single-dataset baselines and prior multi-dataset methods, accelerates training and inference, and provides strong transfer-learning benefits, even with substantially fewer annotations. Overall, MultiTalent enables holistic, multi-dataset pre-training and practical, scalable deployment of robust segmentation models for clinical imaging.
Abstract
The medical imaging community generates a wealth of datasets, many of which are openly accessible and annotated for specific diseases and tasks such as multi-organ or lesion segmentation. Current practices continue to limit model training and supervised pre-training to one or a few similar datasets, neglecting the synergistic potential of other available annotated data. We propose MultiTalent, a method that leverages multiple CT datasets with diverse and conflicting class definitions to train a single model for a comprehensive structure segmentation. Our results demonstrate improved segmentation performance compared to previous related approaches, systematically, also compared to single dataset training using state-of-the-art methods, especially for lesion segmentation and other challenging structures. We show that MultiTalent also represents a powerful foundation model that offers a superior pre-training for various segmentation tasks compared to commonly used supervised or unsupervised pre-training baselines. Our findings offer a new direction for the medical imaging community to effectively utilize the wealth of available data for improved segmentation performance. The code and model weights will be published here: [tba]
