A Unified Framework for Foreground and Anonymization Area Segmentation in CT and MRI Data
Michal Nohel, Constantin Ulrich, Jonathan Suprijadi, Tassilo Wald, Klaus Maier-Hein
TL;DR
The paper introduces an open-source preprocessing toolkit for self-supervised learning in 3D medical imaging that tackles two bottlenecks: filtering computationally wasteful air regions via anatomical foreground segmentation and preventing misleading supervision from anonymized data via deface/reface anonymization segmentation. It deploys two nnU-Net-based networks trained on large, multi-modal CT and MRI datasets to delineate foreground anatomy and anonymized facial regions, respectively, with a unified 3D patch approach of $192\times192\times192$ and $z$-score normalization. Empirical results show near-perfect Dice scores ($\approx 99\%$) for foreground segmentation on in-distribution data and robust performance on external datasets, while anonymization segmentation also achieves high Dice across deface/reface procedures, indicating strong applicability for privacy-preserving SSL in CT and MRI. This work enables faster SSL pretraining and safer supervision across diverse 3D medical imaging tasks, potentially accelerating downstream diagnosis and analysis while maintaining data privacy.
Abstract
This study presents an open-source toolkit to address critical challenges in preprocessing data for self-supervised learning (SSL) for 3D medical imaging, focusing on data privacy and computational efficiency. The toolkit comprises two main components: a segmentation network that delineates foreground regions to optimize data sampling and thus reduce training time, and a segmentation network that identifies anonymized regions, preventing erroneous supervision in reconstruction-based SSL methods. Experimental results demonstrate high robustness, with mean Dice scores exceeding 98.5 across all anonymization methods and surpassing 99.5 for foreground segmentation tasks, highlighting the efficacy of the toolkit in supporting SSL applications in 3D medical imaging for both CT and MRI images. The weights and code is available at https://github.com/MIC-DKFZ/Foreground-and-Anonymization-Area-Segmentation.
