Explainable fetal ultrasound quality assessment with progressive concept bottleneck models

Manxi Lin; Aasa Feragen; Kamil Mikolaj; Zahra Bashir; Martin Grønnebæk Tolsgaard; Anders Nymark Christensen

Explainable fetal ultrasound quality assessment with progressive concept bottleneck models

Manxi Lin, Aasa Feragen, Kamil Mikolaj, Zahra Bashir, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen

TL;DR

The paper tackles the challenge of explainable fetal ultrasound quality assessment, where accurate identification of standard planes is essential yet difficult due to artifacts. It introduces Progressive Concept Bottleneck Models (P-CBMs) that marshal a three-stage pipeline—observer predicting segmentation concepts ($x \xrightarrow{g} s$), perceiver deriving property concepts ($s \xrightarrow{l} c$), and predictor concluding the final label ($c \xrightarrow{f} y$)—to enforce predictions that hinge on human-interpretable ISUOG criteria. By grounding concepts in segmentation and ISUOG properties, the approach mitigates information leakage, provides faithful explanations, and demonstrates strong generalization to external datasets without fine-tuning, outperforming concept-free baselines. The work evidences both improved accuracy and robust, actionable explanations, offering clinicians real-time guidance for optimizing image acquisition and downstream biometric assessments. Overall, P-CBM advances explainable, clinically aligned AI for fetal ultrasound with potential for deployment across diverse centers and setups.

Abstract

The quality of fetal ultrasound screening scans directly influences the precision of biometric measurements. However, acquiring high-quality scans is labor-intensive and highly relies on the operator's skills. Considering the low contrastiveness and imaging artifacts that widely exist in ultrasound, even a dedicated deep-learning model can be vulnerable to learning from confounding information in the image. In this paper, we propose a holistic and explainable method for fetal ultrasound quality assessment, where we design a hierarchical concept bottleneck model by introducing human-readable ``concepts" into the task and imitating the sequential expert decision-making process. This hierarchical information flow forces the model to learn concepts from semantically meaningful areas: The model first passes through a layer of visual, segmentation-based concepts, and next a second layer of property concepts directly associated with the decision-making task. We consider the quality assessment to be in a more challenging but more realistic setting, with fine-grained image recognition. Experiments show that our model outperforms equivalent concept-free models on an in-house dataset, and shows better generalizability on two public benchmarks, one from Spain and one from Africa, without any fine-tuning.

Explainable fetal ultrasound quality assessment with progressive concept bottleneck models

TL;DR

), perceiver deriving property concepts (

), and predictor concluding the final label (

)—to enforce predictions that hinge on human-interpretable ISUOG criteria. By grounding concepts in segmentation and ISUOG properties, the approach mitigates information leakage, provides faithful explanations, and demonstrates strong generalization to external datasets without fine-tuning, outperforming concept-free baselines. The work evidences both improved accuracy and robust, actionable explanations, offering clinicians real-time guidance for optimizing image acquisition and downstream biometric assessments. Overall, P-CBM advances explainable, clinically aligned AI for fetal ultrasound with potential for deployment across diverse centers and setups.

Abstract

Paper Structure (38 sections, 2 equations, 10 figures, 9 tables)

This paper contains 38 sections, 2 equations, 10 figures, 9 tables.

Introduction
Holistic and explainable fetal US analysis.
Avoiding unintended harm from information leakage.
Case study in 3rd-trimester growth scans.
Our key contributions are as follows:
Related work
Fetal ultrasound quality assessment
Model explainability
Explaining with pixel attributions.
Explaining with property concepts.
Information leakage in concept bottleneck models
Segmentation for classification
Method
"Seeing" with an observer: Visually intuitive explanations while scanning
'Soft' and 'hard' concepts: Retaining performance while avoiding leakage.
...and 23 more sections

Figures (10)

Figure 1: Examples of an abdomen standard plane and two different definitions of the negative category.
Figure 2: Examples of head standard planes and non-standard planes.
Figure 3: The process of a specialist recognizing a femur standard plane. The concepts in the "Conceiving" stage are taken from salomon2019isuog.
Figure 4: Illustration of the network architecture of the proposed P-CBM.
Figure 5: Examples of the regions of interest associated with different property concepts. (a), (b), (c), and (d) present the regions for the femur, abdomen, head, and cervix anatomy respectively.
...and 5 more figures

Explainable fetal ultrasound quality assessment with progressive concept bottleneck models

TL;DR

Abstract

Explainable fetal ultrasound quality assessment with progressive concept bottleneck models

Authors

TL;DR

Abstract

Table of Contents

Figures (10)