Deep infant brain segmentation from multi-contrast MRI
Malte Hoffmann, Lilla Zöllei, Adrian V. Dalca
TL;DR
This work tackles the challenge of robust infant brain segmentation across diverse ages and MRI protocols by introducing BabySeg, a group-input segmentation framework that uses domain randomization and a data engine to synthesize varied contrasts. The core idea is a U-Net with group convolutions that can ingest a flexible number of input scans, enabling segmentation from any combination of modalities without retraining. The authors demonstrate state-of-the-art performance across multiple age cohorts and input configurations with significantly faster runtimes, while acknowledging limitations in finer cerebellar structures and the need for initial image alignment. Overall, BabySeg offers a scalable, versatile tool for pediatric neuroimaging that can facilitate large-scale analyses in research and clinical settings.
Abstract
Segmentation of magnetic resonance images (MRI) facilitates analysis of human brain development by delineating anatomical structures. However, in infants and young children, accurate segmentation is challenging due to development and imaging constraints. Pediatric brain MRI is notoriously difficult to acquire, with inconsistent availability of imaging modalities, substantial non-head anatomy in the field of view, and frequent motion artifacts. This has led to specialized segmentation models that are often limited to specific image types or narrow age groups, or that are fragile for more variable images such as those acquired clinically. We address this method fragmentation with BabySeg, a deep learning brain segmentation framework for infants and young children that supports diverse MRI protocols, including repeat scans and image types unavailable during training. Our approach builds on recent domain randomization techniques, which synthesize training images far beyond realistic bounds to promote dataset shift invariance. We also describe a mechanism that enables models to flexibly pool and interact features from any number of input scans. We demonstrate state-of-the-art performance that matches or exceeds the accuracy of several existing methods for various age cohorts and input configurations using a single model, in a fraction of the runtime required by many existing tools.
