Table of Contents
Fetching ...

Automated Fetal Biometry Assessment with Deep Ensembles using Sparse-Sampling of 2D Intrapartum Ultrasound Images

Jayroop Ramesh, Valentin Bacher, Mark C. Eid, Hoda Kalabizadeh, Christian Rupprecht, Ana IL Namburete, Pak-Hei Yeung, Madeleine K. Wyburd, Nicola K. Dinsdale

TL;DR

The study tackles variability in intrapartum ultrasound-based fetal biometrics by proposing a three-stage automated pipeline: (1) standard-plane classification to identify relevant frames, (2) segmentation of the fetal head and pubic symphysis, and (3) computation of angle of progression (AoP) and head-symphysis distance (HSD) from the segmentations. It leverages sparse-frame sampling and ensemble learning to improve generalization across diverse ultrasound acquisitions, with postprocessing including largest-component retention and ellipse fitting to stabilize measurements. On unseen hold-out data, the classification ensemble CLF-5 achieves ACC $0.9452$, F1 $0.922$, AUC $0.983$, MCC $0.836$, while SEG-3 achieves DSC $0.919$, ASD $5.712$, and HD $19.735$, leading to measurement errors of $Δ_{AoP}=8.906^ ext{o}$ and $Δ_{HSD}=14.356$ pixels. The results demonstrate robust performance and potential utility in clinical risk stratification and efficient prenatal care, despite variability in US imaging conditions.

Abstract

The International Society of Ultrasound advocates Intrapartum Ultrasound (US) Imaging in Obstetrics and Gynecology (ISUOG) to monitor labour progression through changes in fetal head position. Two reliable ultrasound-derived parameters that are used to predict outcomes of instrumental vaginal delivery are the angle of progression (AoP) and head-symphysis distance (HSD). In this work, as part of the Intrapartum Ultrasounds Grand Challenge (IUGC) 2024, we propose an automated fetal biometry measurement pipeline to reduce intra- and inter-observer variability and improve measurement reliability. Our pipeline consists of three key tasks: (i) classification of standard planes (SP) from US videos, (ii) segmentation of fetal head and pubic symphysis from the detected SPs, and (iii) computation of the AoP and HSD from the segmented regions. We perform sparse sampling to mitigate class imbalances and reduce spurious correlations in task (i), and utilize ensemble-based deep learning methods for task (i) and (ii) to enhance generalizability under different US acquisition settings. Finally, to promote robustness in task iii) with respect to the structural fidelity of measurements, we retain the largest connected components and apply ellipse fitting to the segmentations. Our solution achieved ACC: 0.9452, F1: 0.9225, AUC: 0.983, MCC: 0.8361, DSC: 0.918, HD: 19.73, ASD: 5.71, $Δ_{AoP}$: 8.90 and $Δ_{HSD}$: 14.35 across an unseen hold-out set of 4 patients and 224 US frames. The results from the proposed automated pipeline can improve the understanding of labour arrest causes and guide the development of clinical risk stratification tools for efficient and effective prenatal care.

Automated Fetal Biometry Assessment with Deep Ensembles using Sparse-Sampling of 2D Intrapartum Ultrasound Images

TL;DR

The study tackles variability in intrapartum ultrasound-based fetal biometrics by proposing a three-stage automated pipeline: (1) standard-plane classification to identify relevant frames, (2) segmentation of the fetal head and pubic symphysis, and (3) computation of angle of progression (AoP) and head-symphysis distance (HSD) from the segmentations. It leverages sparse-frame sampling and ensemble learning to improve generalization across diverse ultrasound acquisitions, with postprocessing including largest-component retention and ellipse fitting to stabilize measurements. On unseen hold-out data, the classification ensemble CLF-5 achieves ACC , F1 , AUC , MCC , while SEG-3 achieves DSC , ASD , and HD , leading to measurement errors of and pixels. The results demonstrate robust performance and potential utility in clinical risk stratification and efficient prenatal care, despite variability in US imaging conditions.

Abstract

The International Society of Ultrasound advocates Intrapartum Ultrasound (US) Imaging in Obstetrics and Gynecology (ISUOG) to monitor labour progression through changes in fetal head position. Two reliable ultrasound-derived parameters that are used to predict outcomes of instrumental vaginal delivery are the angle of progression (AoP) and head-symphysis distance (HSD). In this work, as part of the Intrapartum Ultrasounds Grand Challenge (IUGC) 2024, we propose an automated fetal biometry measurement pipeline to reduce intra- and inter-observer variability and improve measurement reliability. Our pipeline consists of three key tasks: (i) classification of standard planes (SP) from US videos, (ii) segmentation of fetal head and pubic symphysis from the detected SPs, and (iii) computation of the AoP and HSD from the segmented regions. We perform sparse sampling to mitigate class imbalances and reduce spurious correlations in task (i), and utilize ensemble-based deep learning methods for task (i) and (ii) to enhance generalizability under different US acquisition settings. Finally, to promote robustness in task iii) with respect to the structural fidelity of measurements, we retain the largest connected components and apply ellipse fitting to the segmentations. Our solution achieved ACC: 0.9452, F1: 0.9225, AUC: 0.983, MCC: 0.8361, DSC: 0.918, HD: 19.73, ASD: 5.71, : 8.90 and : 14.35 across an unseen hold-out set of 4 patients and 224 US frames. The results from the proposed automated pipeline can improve the understanding of labour arrest causes and guide the development of clinical risk stratification tools for efficient and effective prenatal care.

Paper Structure

This paper contains 16 sections, 2 equations, 4 figures, 8 tables.

Figures (4)

  • Figure 1: Simplified flow chart of a batch of frames being processed by all three tasks. 'avg.' is an average operator, ensembling the predictors and $\otimes$ is the masking of the frames to obtain only the standard planes.
  • Figure 5: Overview of the training process, from spare-sampling to augmentation, followed by classification/segmentation.
  • Figure 6: Predicted SPs from Phase 1, True Positives on Row 1, True Negatives on Row 2 and a combination of False Positives and Negatives on Row 3.
  • Figure 7: Predicted segmentation masks and corresponding Dice scores.