Sequential Spatial-Temporal Network for Interpretable Automatic Ultrasonic Assessment of Fetal Head during labor

Jie Gan; Zhuonan Liang; Jianan Fan; Lisa Mcguire; Caterina Watson; Jacqueline Spurway; Jillian Clarke; Weidong Cai

Sequential Spatial-Temporal Network for Interpretable Automatic Ultrasonic Assessment of Fetal Head during labor

Jie Gan, Zhuonan Liang, Jianan Fan, Lisa Mcguire, Caterina Watson, Jacqueline Spurway, Jillian Clarke, Weidong Cai

TL;DR

The paper tackles automatic, interpretable assessment of fetal head descent during labor using intrapartum ultrasound by focusing on the ISUOG metrics $AoP$ and $HSD$. It introduces the Sequential Spatial-Temporal Network (SSTN), a first interpretable model for intrapartum ultrasound video, which sequentially identifies planes, segments anatomical structures, and detects landmarks to compute $AoP$ and $HSD$ while leveraging temporal context. Through multitask supervision and a three-stage architecture (feature enhancement, Video Swin Transformer encoder, and a ResConv-UpConv decoder), SSTN achieves state-of-the-art performance, reducing $\Delta$AoP by 18% and $\Delta$HSD by 22% compared to baselines. The approach demonstrates improved robustness and interpretability, with potential for clinical deployment in labor assessments and guidance for future research in ultrasound video analysis.

Abstract

The intrapartum ultrasound guideline established by ISUOG highlights the Angle of Progression (AoP) and Head Symphysis Distance (HSD) as pivotal metrics for assessing fetal head descent and predicting delivery outcomes. Accurate measurement of the AoP and HSD requires a structured process. This begins with identifying standardized ultrasound planes, followed by the detection of specific anatomical landmarks within the regions of the pubic symphysis and fetal head that correlate with the delivery parameters AoP and HSD. Finally, these measurements are derived based on the identified anatomical landmarks. Addressing the clinical demands and standard operation process outlined in the ISUOG guideline, we introduce the Sequential Spatial-Temporal Network (SSTN), the first interpretable model specifically designed for the video of intrapartum ultrasound analysis. The SSTN operates by first identifying ultrasound planes, then segmenting anatomical structures such as the pubic symphysis and fetal head, and finally detecting key landmarks for precise measurement of HSD and AoP. Furthermore, the cohesive framework leverages task-related information to improve accuracy and reliability. Experimental evaluations on clinical datasets demonstrate that SSTN significantly surpasses existing models, reducing the mean absolute error by 18% for AoP and 22% for HSD.

Sequential Spatial-Temporal Network for Interpretable Automatic Ultrasonic Assessment of Fetal Head during labor

TL;DR

Abstract

Sequential Spatial-Temporal Network for Interpretable Automatic Ultrasonic Assessment of Fetal Head during labor

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)