Table of Contents
Fetching ...

Self-supervised Pretraining for Cardiovascular Magnetic Resonance Cine Segmentation

Rob A. J. de Mooij, Josien P. W. Pluim, Cian M. Scannell

TL;DR

This study evaluates self-supervised pretraining (SSP) for CNN-based CMR cine segmentation, benchmarking four SSP methods—SimCLR, positional contrastive learning (PCL), DINO, and masked image modeling (MIM)—on unlabeled 4D CMR stacks and fine-tuning with varying amounts of labeled data. It finds that SSP provides gains primarily when labeled data are scarce (e.g., improving DSC from 0.82 to 0.86 with MIM for 10 subjects), while offering little to no benefit when abundant labeled data are available; the choice of SSP method is critical, with DINO performing poorly for CNNs and MIM often yielding the strongest small-data gains. The study also shows modest generalization benefits to unseen cardiac phases and vendors under SSP, but emphasizes that data augmentation plays a major role in generalization, suggesting SSP is most useful as a data-scarcity remedy rather than a universal improvement. Public code is provided to enable replication and further exploration in this domain.

Abstract

Self-supervised pretraining (SSP) has shown promising results in learning from large unlabeled datasets and, thus, could be useful for automated cardiovascular magnetic resonance (CMR) short-axis cine segmentation. However, inconsistent reports of the benefits of SSP for segmentation have made it difficult to apply SSP to CMR. Therefore, this study aimed to evaluate SSP methods for CMR cine segmentation. To this end, short-axis cine stacks of 296 subjects (90618 2D slices) were used for unlabeled pretraining with four SSP methods; SimCLR, positional contrastive learning, DINO, and masked image modeling (MIM). Subsets of varying numbers of subjects were used for supervised fine-tuning of 2D models for each SSP method, as well as to train a 2D baseline model from scratch. The fine-tuned models were compared to the baseline using the 3D Dice similarity coefficient (DSC) in a test dataset of 140 subjects. The SSP methods showed no performance gains with the largest supervised fine-tuning subset compared to the baseline (DSC = 0.89). When only 10 subjects (231 2D slices) are available for supervised training, SSP using MIM (DSC = 0.86) improves over training from scratch (DSC = 0.82). This study found that SSP is valuable for CMR cine segmentation when labeled training data is scarce, but does not aid state-of-the-art deep learning methods when ample labeled data is available. Moreover, the choice of SSP method is important. The code is publicly available at: https://github.com/q-cardIA/ssp-cmr-cine-segmentation

Self-supervised Pretraining for Cardiovascular Magnetic Resonance Cine Segmentation

TL;DR

This study evaluates self-supervised pretraining (SSP) for CNN-based CMR cine segmentation, benchmarking four SSP methods—SimCLR, positional contrastive learning (PCL), DINO, and masked image modeling (MIM)—on unlabeled 4D CMR stacks and fine-tuning with varying amounts of labeled data. It finds that SSP provides gains primarily when labeled data are scarce (e.g., improving DSC from 0.82 to 0.86 with MIM for 10 subjects), while offering little to no benefit when abundant labeled data are available; the choice of SSP method is critical, with DINO performing poorly for CNNs and MIM often yielding the strongest small-data gains. The study also shows modest generalization benefits to unseen cardiac phases and vendors under SSP, but emphasizes that data augmentation plays a major role in generalization, suggesting SSP is most useful as a data-scarcity remedy rather than a universal improvement. Public code is provided to enable replication and further exploration in this domain.

Abstract

Self-supervised pretraining (SSP) has shown promising results in learning from large unlabeled datasets and, thus, could be useful for automated cardiovascular magnetic resonance (CMR) short-axis cine segmentation. However, inconsistent reports of the benefits of SSP for segmentation have made it difficult to apply SSP to CMR. Therefore, this study aimed to evaluate SSP methods for CMR cine segmentation. To this end, short-axis cine stacks of 296 subjects (90618 2D slices) were used for unlabeled pretraining with four SSP methods; SimCLR, positional contrastive learning, DINO, and masked image modeling (MIM). Subsets of varying numbers of subjects were used for supervised fine-tuning of 2D models for each SSP method, as well as to train a 2D baseline model from scratch. The fine-tuned models were compared to the baseline using the 3D Dice similarity coefficient (DSC) in a test dataset of 140 subjects. The SSP methods showed no performance gains with the largest supervised fine-tuning subset compared to the baseline (DSC = 0.89). When only 10 subjects (231 2D slices) are available for supervised training, SSP using MIM (DSC = 0.86) improves over training from scratch (DSC = 0.82). This study found that SSP is valuable for CMR cine segmentation when labeled training data is scarce, but does not aid state-of-the-art deep learning methods when ample labeled data is available. Moreover, the choice of SSP method is important. The code is publicly available at: https://github.com/q-cardIA/ssp-cmr-cine-segmentation
Paper Structure (14 sections, 1 figure, 4 tables)

This paper contains 14 sections, 1 figure, 4 tables.

Figures (1)

  • Figure 1: Visualizations of the four SSP methods that were used.