Mitigating False Predictions In Unreasonable Body Regions
Constantin Ulrich, Catherine Knobloch, Julius C. Holzschuh, Tassilo Wald, Maximilian R. Rokuss, Maximilian Zenk, Maximilian Fischer, Michael Baumgartner, Fabian Isensee, Klaus H. Maier-Hein
TL;DR
The paper addresses poor generalization of 3D CT segmentation when test data have larger Field-of-View (FOV) than training data, which leads to anatomically implausible predictions. It introduces Region Loss, a training-time penalty guided by a Body Part Regression (BPR) model that maps axial slices to standardized positions, enabling suppression of predictions in invalid body regions. The method is validated in both multi-dataset (MD) and single-dataset (SD) training regimes, within the MultiTalent/nnU-Net framework, and shows reduced false positives in spurious regions (up to 85.7% in MD) along with improved Dice scores. This approach enhances cross-FOV generalization for CT organ/tumor segmentation and offers a robust alternative to postprocessing, with potential extensions to other 3D modalities and detection tasks.
Abstract
Despite considerable strides in developing deep learning models for 3D medical image segmentation, the challenge of effectively generalizing across diverse image distributions persists. While domain generalization is acknowledged as vital for robust application in clinical settings, the challenges stemming from training with a limited Field of View (FOV) remain unaddressed. This limitation leads to false predictions when applied to body regions beyond the FOV of the training data. In response to this problem, we propose a novel loss function that penalizes predictions in implausible body regions, applicable in both single-dataset and multi-dataset training schemes. It is realized with a Body Part Regression model that generates axial slice positional scores. Through comprehensive evaluation using a test set featuring varying FOVs, our approach demonstrates remarkable improvements in generalization capabilities. It effectively mitigates false positive tumor predictions up to 85% and significantly enhances overall segmentation performance.
