M-TabNet: A Multi-Encoder Transformer Model for Predicting Neonatal Birth Weight from Multimodal Data

Muhammad Mursil; Hatem A. Rashwan; Luis Santos-Calderon; Pere Cavalle-Busquets; Michelle M. Murphy; Domenec Puig

M-TabNet: A Multi-Encoder Transformer Model for Predicting Neonatal Birth Weight from Multimodal Data

Muhammad Mursil, Hatem A. Rashwan, Luis Santos-Calderon, Pere Cavalle-Busquets, Michelle M. Murphy, Domenec Puig

TL;DR

Birth weight (BW) prediction in early gestation is challenged by ultrasound limitations. The authors introduce M-TabNet, a transformer-based multi-encoder that fuses physiological, nutritional, lifestyle, and genetic maternal data to predict BW before 12 weeks. On the Reus–Tarragona cohort, M-TabNet achieves $MAE = 122$ g and $R^2 = 0.94$, with high LBW classification performance ($\text{sensitivity}=97.55\%$, $\text{specificity}=94.48\%$); generalization to the IEEE dataset yields $MAE = 105$ g and $R^2 = 0.95$. SHAP and feature-importance analyses highlight maternal age, tobacco exposure, and vitamin B12 status as key drivers, supporting interpretable, personalized prenatal care and early risk stratification for at-risk pregnancies.

Abstract

Birth weight (BW) is a key indicator of neonatal health, with low birth weight (LBW) linked to increased mortality and morbidity. Early prediction of BW enables timely interventions; however, current methods like ultrasonography have limitations, including reduced accuracy before 20 weeks and operator dependent variability. Existing models often neglect nutritional and genetic influences, focusing mainly on physiological and lifestyle factors. This study presents an attention-based transformer model with a multi-encoder architecture for early (less than 12 weeks of gestation) BW prediction. Our model effectively integrates diverse maternal data such as physiological, lifestyle, nutritional, and genetic, addressing limitations seen in prior attention-based models such as TabNet. The model achieves a Mean Absolute Error (MAE) of 122 grams and an R-squared value of 0.94, demonstrating high predictive accuracy and interoperability with our in-house private dataset. Independent validation confirms generalizability (MAE: 105 grams, R-squared: 0.95) with the IEEE children dataset. To enhance clinical utility, predicted BW is classified into low and normal categories, achieving a sensitivity of 97.55% and a specificity of 94.48%, facilitating early risk stratification. Model interpretability is reinforced through feature importance and SHAP analyses, highlighting significant influences of maternal age, tobacco exposure, and vitamin B12 status, with genetic factors playing a secondary role. Our results emphasize the potential of advanced deep-learning models to improve early BW prediction, offering clinicians a robust, interpretable, and personalized tool for identifying pregnancies at risk and optimizing neonatal outcomes.

M-TabNet: A Multi-Encoder Transformer Model for Predicting Neonatal Birth Weight from Multimodal Data

TL;DR

Abstract

M-TabNet: A Multi-Encoder Transformer Model for Predicting Neonatal Birth Weight from Multimodal Data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)