A comprehensive survey on deep active learning in medical image analysis

Haoran Wang; Qiuye Jin; Shiman Li; Siyu Liu; Manning Wang; Zhijian Song

A comprehensive survey on deep active learning in medical image analysis

Haoran Wang, Qiuye Jin, Shiman Li, Siyu Liu, Manning Wang, Zhijian Song

TL;DR

The paper addresses the high annotation cost in medical image analysis by surveying deep active learning (AL) methods, especially their core components—informativeness evaluation and sampling strategies—and their integration with semi-supervised, self-supervised, and other label-efficient techniques. It introduces a comprehensive taxonomy of uncertainty- and representativeness-based AL approaches, discusses region-based and generative methods, and highlights practical considerations for medical tasks such as classification, segmentation, and reconstruction. The authors provide empirical evaluations across multiple datasets, demonstrating when uncertainty and representativeness strategies excel and how budget size shifts performance, while also sharing data and code to promote reproducibility. Finally, the survey outlines challenges and future directions, including uncertainty calibration, better representativeness metrics, weak annotations, advanced generative models, and foundation-model integration, underscoring the potential for impactful, cost-reducing AL methods in clinical practice.

Abstract

Deep learning has achieved widespread success in medical image analysis, leading to an increasing demand for large-scale expert-annotated medical image datasets. Yet, the high cost of annotating medical images severely hampers the development of deep learning in this field. To reduce annotation costs, active learning aims to select the most informative samples for annotation and train high-performance models with as few labeled samples as possible. In this survey, we review the core methods of active learning, including the evaluation of informativeness and sampling strategy. For the first time, we provide a detailed summary of the integration of active learning with other label-efficient techniques, such as semi-supervised, self-supervised learning, and so on. We also summarize active learning works that are specifically tailored to medical image analysis. Additionally, we conduct a thorough comparative analysis of the performance of different AL methods in medical image analysis with experiments. In the end, we offer our perspectives on the future trends and challenges of active learning and its applications in medical image analysis.

A comprehensive survey on deep active learning in medical image analysis

TL;DR

Abstract

Paper Structure (64 sections, 13 equations, 6 figures, 13 tables)

This paper contains 64 sections, 13 equations, 6 figures, 13 tables.

Introduction
Problem Settings and Formulations of Active Learning
Core Methods of Active Learning
Evaluation of Informativeness: Uncertainty
Uncertainty with Multiple Inferences
Gradient-based Uncertainty
Performance Estimation
Uncertainty-aware Model
Adversarial-based Uncertainty
Evaluation of Informativeness: Representativeness
Clustering-based Active Learning
Cover-based Active Learning
Discrepancy-based Active Learning
Density-based Active Learning
Sampling Strategy
...and 49 more sections

Figures (6)

Figure 1: Illustration of the process of active learning.
Figure 2: Overall framework of this survey.
Figure 3: The taxonomy of uncertainty-based active learning.
Figure 4: The taxonomy of representativeness-based active learning.
Figure 5: The taxonomy of different sampling strategies in active learning.
...and 1 more figures

A comprehensive survey on deep active learning in medical image analysis

TL;DR

Abstract

A comprehensive survey on deep active learning in medical image analysis

Authors

TL;DR

Abstract

Table of Contents

Figures (6)