Scaling nnU-Net for CBCT Segmentation
Fabian Isensee, Yannick Kirchhoff, Lars Kraemer, Maximilian Rokuss, Constantin Ulrich, Klaus H. Maier-Hein
TL;DR
This work addresses multi-structure segmentation in dental CBCT via a tailored nnU-Net ResEnc L configuration. The authors increase patch size, deepen the network, adjust augmentation (notably disabling left-right mirroring), extend training, and apply per-class postprocessing cutoffs, achieving a test Dice of $0.9253$ and HD95 of $18.472$, with a mean rank of $4.6$ and first place in ToothFairy2. They validate gains through five-fold cross-validation and qualitative analyses, and provide open-source code for reproducibility. The approach demonstrates robust, high-precision CBCT segmentation relevant for dental diagnostics and planning, with practical implications for automated clinical workflows.
Abstract
This paper presents our approach to scaling the nnU-Net framework for multi-structure segmentation on Cone Beam Computed Tomography (CBCT) images, specifically in the scope of the ToothFairy2 Challenge. We leveraged the nnU-Net ResEnc L model, introducing key modifications to patch size, network topology, and data augmentation strategies to address the unique challenges of dental CBCT imaging. Our method achieved a mean Dice coefficient of 0.9253 and HD95 of 18.472 on the test set, securing a mean rank of 4.6 and with it the first place in the ToothFairy2 challenge. The source code is publicly available, encouraging further research and development in the field.
