Table of Contents
Fetching ...

Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes

Tianwei Zhang, Dong Wei, Mengmeng Zhu, Shi Gu, Yefeng Zheng

TL;DR

The paper tackles limited labeled data in medical image analysis by introducing two self-supervised pretext tasks that exploit anatomy-oriented imaging planes. It presents Relative Orientation Regression and Relative Location Regression, along with a multitask SSL variant, to pretrain models on cardiac and knee MRI data. Pretraining with these tasks significantly improves transfer to semantic segmentation and diagnostic classification, especially in low-data regimes, outperforming several SSL baselines and ImageNet-pretrained models. This anatomy-aware SSL approach reduces annotation needs and enhances practical medical imaging pipelines across multiple organs and tasks.

Abstract

Self-supervised learning has emerged as a powerful tool for pretraining deep networks on unlabeled data, prior to transfer learning of target tasks with limited annotation. The relevance between the pretraining pretext and target tasks is crucial to the success of transfer learning. Various pretext tasks have been proposed to utilize properties of medical image data (e.g., three dimensionality), which are more relevant to medical image analysis than generic ones for natural images. However, previous work rarely paid attention to data with anatomy-oriented imaging planes, e.g., standard cardiac magnetic resonance imaging views. As these imaging planes are defined according to the anatomy of the imaged organ, pretext tasks effectively exploiting this information can pretrain the networks to gain knowledge on the organ of interest. In this work, we propose two complementary pretext tasks for this group of medical image data based on the spatial relationship of the imaging planes. The first is to learn the relative orientation between the imaging planes and implemented as regressing their intersecting lines. The second exploits parallel imaging planes to regress their relative slice locations within a stack. Both pretext tasks are conceptually straightforward and easy to implement, and can be combined in multitask learning for better representation learning. Thorough experiments on two anatomical structures (heart and knee) and representative target tasks (semantic segmentation and classification) demonstrate that the proposed pretext tasks are effective in pretraining deep networks for remarkably boosted performance on the target tasks, and superior to other recent approaches.

Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes

TL;DR

The paper tackles limited labeled data in medical image analysis by introducing two self-supervised pretext tasks that exploit anatomy-oriented imaging planes. It presents Relative Orientation Regression and Relative Location Regression, along with a multitask SSL variant, to pretrain models on cardiac and knee MRI data. Pretraining with these tasks significantly improves transfer to semantic segmentation and diagnostic classification, especially in low-data regimes, outperforming several SSL baselines and ImageNet-pretrained models. This anatomy-aware SSL approach reduces annotation needs and enhances practical medical imaging pipelines across multiple organs and tasks.

Abstract

Self-supervised learning has emerged as a powerful tool for pretraining deep networks on unlabeled data, prior to transfer learning of target tasks with limited annotation. The relevance between the pretraining pretext and target tasks is crucial to the success of transfer learning. Various pretext tasks have been proposed to utilize properties of medical image data (e.g., three dimensionality), which are more relevant to medical image analysis than generic ones for natural images. However, previous work rarely paid attention to data with anatomy-oriented imaging planes, e.g., standard cardiac magnetic resonance imaging views. As these imaging planes are defined according to the anatomy of the imaged organ, pretext tasks effectively exploiting this information can pretrain the networks to gain knowledge on the organ of interest. In this work, we propose two complementary pretext tasks for this group of medical image data based on the spatial relationship of the imaging planes. The first is to learn the relative orientation between the imaging planes and implemented as regressing their intersecting lines. The second exploits parallel imaging planes to regress their relative slice locations within a stack. Both pretext tasks are conceptually straightforward and easy to implement, and can be combined in multitask learning for better representation learning. Thorough experiments on two anatomical structures (heart and knee) and representative target tasks (semantic segmentation and classification) demonstrate that the proposed pretext tasks are effective in pretraining deep networks for remarkably boosted performance on the target tasks, and superior to other recent approaches.
Paper Structure (29 sections, 6 equations, 7 figures, 5 tables)

This paper contains 29 sections, 6 equations, 7 figures, 5 tables.

Figures (7)

  • Figure 1: Network architecture for SSL of the proposed pretext tasks (FC: fully connected layer). Top: relative orientation regression; bottom: relative location regression.
  • Figure 2: Standard CMR views. (a) A mid-ventricular SAX view: the straight lines are the intersecting lines with the 2C and 4C views in (b) and (c), respectively. (b)--(c) Standard 2C and 4C views: the parallel lines indicate intersecting lines with the stack of SAX views (with normalized relative locations marked), in which the white line indicates the SAX view in (a). (d) 3D visualization of the images in (a)--(c).
  • Figure 3: Left to right: four SAX CMR slices from the base (relative location$=$0.0) to the apex (relative location$=$1.0) of the LV.
  • Figure 4: Visualization of the predicted heatmaps for the pretext task of relative orientation regression on the DSBCC CMR validation set, along with the ground truth. ED: end-diastolic, ES: end-systolic. Best viewed when digitally zoomed in.
  • Figure 5: Visualization of the predicted heatmaps for the pretext task of relative orientation regression on the knee MRI validation data zbontar2018fastmri, along with the ground truth. Note that symmetric slices on different sides of the central slice are mapped to the same relative locations ($l_{s_i}$) by Eqn. (\ref{['eq:sym_loc']}). ACL: anterior cruciate ligament. Best viewed when digitally zoomed in.
  • ...and 2 more figures