Robustness of Deep Learning for Accelerated MRI: Benefits of Diverse Training Data
Kang Lin, Reinhard Heckel
TL;DR
This work addresses how training data diversity affects robustness of deep learning reconstructions for accelerated MRI under distribution shifts across anatomy, contrast, scanners, and forward-model settings. By comparing joint versus separate training across multiple architectures (U-net, ViT, VarNet) and leveraging a large, heterogeneous dataset, the authors show that diverse training yields out-of-distribution gains without sacrificing in-distribution performance, and that robustness correlates with train-test distribution similarity via CLIP-based metrics. They also reveal distributional overfitting and demonstrate pathology reconstruction and generalization from healthy data, extending findings beyond the fastMRI dataset to a broader set of 13 datasets. The practical takeaway is that one robust MRI reconstruction model trained on diverse data can outperform or match multiple specialized models, though careful training and early stopping are essential to maintain robustness while approaching real-world deployment.
Abstract
Deep learning based methods for image reconstruction are state-of-the-art for a variety of imaging tasks. However, neural networks often perform worse if the training data differs significantly from the data they are applied to. For example, a model trained for accelerated magnetic resonance imaging (MRI) on one scanner performs worse on another scanner. In this work, we investigate the impact of the training data on a model's performance and robustness for accelerated MRI. We find that models trained on the combination of various data distributions, such as those obtained from different MRI scanners and anatomies, exhibit robustness equal or superior to models trained on the best single distribution for a specific target distribution. Thus training on such diverse data tends to improve robustness. Furthermore, training on such a diverse dataset does not compromise in-distribution performance, i.e., a model trained on diverse data yields in-distribution performance at least as good as models trained on the more narrow individual distributions. Our results suggest that training a model for imaging on a variety of distributions tends to yield a more effective and robust model than maintaining separate models for individual distributions.
