A Self-Supervised Framework for Improved Generalisability in Ultrasound B-mode Image Segmentation

Edward Ellis; Andrew Bulpitt; Nasim Parsa; Michael F Byrne; Sharib Ali

A Self-Supervised Framework for Improved Generalisability in Ultrasound B-mode Image Segmentation

Edward Ellis, Andrew Bulpitt, Nasim Parsa, Michael F Byrne, Sharib Ali

TL;DR

This work tackles the challenge of generalisable US B-mode image segmentation with limited labeled data by proposing a domain-informed self-supervised framework. It introduces a Cross-Patch Jigsaw pretext task augmented by frequency-domain band-stop filtering and a learnable Relation Contrastive Loss (RCL) guided by perceptual loss. Across BUSI, BrEaST, and UDIAT datasets, the approach yields consistent gains over supervised baselines, particularly under data scarcity, and demonstrates improved generalisability to out-of-distribution data. The findings underscore the value of domain-specific SSL augmentations and metric learning for robust US segmentation, with practical implications for broader clinical deployment. The work suggests further extension to other US domains, including abdominal imaging, to broaden clinical impact.

Abstract

Ultrasound (US) imaging is clinically invaluable due to its noninvasive and safe nature. However, interpreting US images is challenging, requires significant expertise, and time, and is often prone to errors. Deep learning offers assistive solutions such as segmentation. Supervised methods rely on large, high-quality, and consistently labeled datasets, which are challenging to curate. Moreover, these methods tend to underperform on out-of-distribution data, limiting their clinical utility. Self-supervised learning (SSL) has emerged as a promising alternative, leveraging unlabeled data to enhance model performance and generalisability. We introduce a contrastive SSL approach tailored for B-mode US images, incorporating a novel Relation Contrastive Loss (RCL). RCL encourages learning of distinct features by differentiating positive and negative sample pairs through a learnable metric. Additionally, we propose spatial and frequency-based augmentation strategies for the representation learning on US images. Our approach significantly outperforms traditional supervised segmentation methods across three public breast US datasets, particularly in data-limited scenarios. Notable improvements on the Dice similarity metric include a 4% increase on 20% and 50% of the BUSI dataset, nearly 6% and 9% improvements on 20% and 50% of the BrEaST dataset, and 6.4% and 3.7% improvements on 20% and 50% of the UDIAT dataset, respectively. Furthermore, we demonstrate superior generalisability on the out-of-distribution UDIAT dataset with performance boosts of 20.6% and 13.6% compared to the supervised baseline using 20% and 50% of the BUSI and BrEaST training data, respectively. Our research highlights that domain-inspired SSL can improve US segmentation, especially under data-limited conditions.

A Self-Supervised Framework for Improved Generalisability in Ultrasound B-mode Image Segmentation

TL;DR

Abstract

A Self-Supervised Framework for Improved Generalisability in Ultrasound B-mode Image Segmentation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)