Input layer regularization and automated regularization hyperparameter tuning for myelin water estimation using deep learning
Mirage Modi, Shashank Sule, Jonathan Palumbo, Michael Rozowski, Mustapha Bouhrara, Wojciech Czaja, Richard G. Spencer
TL;DR
This work tackles the ill-posed problem of estimating the myelin water fraction (MWF) from biexponential MR relaxometry signals by integrating traditional regularization with deep learning through Input Layer Regularization (ILR). The authors develop two pipelines—one using neural network-based lambda selection ($ ext{lambda}_{ ext{NN}}$) and one using generalized cross validation ($ ext{lambda}_{ ext{GCV}}$)—and augment the input to a parameter-estimation network with a regularized signal derived from the current lambda estimate, forming the augmented input $ extbf{x}=[ extbf{s}, extbf{G}( extbf{p}^*_{ ext{lambda}}( extbf{s}))]$. They extend ILR to the three-parameter biexponential model and demonstrate improved accuracy in estimating $c_1$ (MWF fraction) on synthetic data and in vivo brain data, with GCV offering robustness at low SNR and the NN offering strong performance at higher SNR. The study shows that hybrid classical-ML approaches can outperform purely data-driven methods in medical imaging tasks and provides a practical framework for automated regularization parameter tuning in MR relaxometry.
Abstract
We propose a novel deep learning method which combines classical regularization with data augmentation for estimating myelin water fraction (MWF) in the brain via biexponential analysis. Our aim is to design an accurate deep learning technique for analysis of signals arising in magnetic resonance relaxometry. In particular, we study the biexponential model, one of the signal models used for MWF estimation. We greatly extend our previous work on \emph{input layer regularization (ILR)} in several ways. We now incorporate optimal regularization parameter selection via a dedicated neural network or generalized cross validation (GCV) on a signal-by-signal, or pixel-by-pixel, basis to form the augmented input signal, and now incorporate estimation of MWF, rather than just exponential time constants, into the analysis. On synthetically generated data, our proposed deep learning architecture outperformed both classical methods and a conventional multi-layer perceptron. On in vivo brain data, our architecture again outperformed other comparison methods, with GCV proving to be somewhat superior to a NN for regularization parameter selection. Thus, ILR improves estimation of MWF within the biexponential model. In addition, classical methods such as GCV may be combined with deep learning to optimize MWF imaging in the human brain.
