Label-Efficient Deep Learning in Medical Image Analysis: Challenges and Future Directions
Cheng Jin, Zhengrui Guo, Yi Lin, Luyang Luo, Hao Chen
TL;DR
This survey organizes label-efficient learning in medical image analysis into four annotation scenarios (no label, insufficient label, inexact label, and label refinement) and surveys 350+ studies across imaging modalities. It foregrounds how self-supervised, semi-supervised, weakly supervised, and active learning strategies can reduce annotation burdens, with HFMs playing a central role in transfer and pretraining. The authors synthesize representative methods (reconstruction, context, contrastive, MIL, pseudo-labeling, generative modeling, and regularization) and discuss challenges in generalization, benchmarking, and clinical deployment. They also outline future directions, including health foundation models, HITL, generative augmentation, federated learning, and standardized evaluation pipelines to accelerate translation to clinical practice.
Abstract
Deep learning has significantly advanced medical imaging analysis (MIA), achieving state-of-the-art performance across diverse clinical tasks. However, its success largely depends on large-scale, high-quality labeled datasets, which are costly and time-consuming to obtain due to the need for expert annotation. To mitigate this limitation, label-efficient deep learning methods have emerged to improve model performance under limited supervision by leveraging labeled, unlabeled, and weakly labeled data. In this survey, we systematically review over 350 peer-reviewed studies and present a comprehensive taxonomy of label-efficient learning methods in MIA. These methods are categorized into four labeling paradigms: no label, insufficient label, inexact label, and label refinement. For each category, we analyze representative techniques across imaging modalities and clinical applications, highlighting shared methodological principles and task-specific adaptations. We also examine the growing role of health foundation models (HFMs) in enabling label-efficient learning through large-scale pre-training and transfer learning, enhancing the use of limited annotations in downstream tasks. Finally, we identify current challenges and future directions to facilitate the translation of label-efficient learning from research promise to everyday clinical care.
